Bias and Variance Analysis of Contemporary Symbolic Regression Methods
Created by W.Langdon from
gp-bibliography.bib Revision:1.8576
- @Article{kammerer:2024:AS,
-
author = "Lukas Kammerer and Gabriel Kronberger and
Stephan Winkler",
-
title = "Bias and Variance Analysis of Contemporary Symbolic
Regression Methods",
-
journal = "Applied Sciences",
-
year = "2024",
-
volume = "14",
-
number = "23",
-
pages = "Article No. 11061",
-
keywords = "genetic algorithms, genetic programming",
-
ISSN = "2076-3417",
-
URL = "
https://www.mdpi.com/2076-3417/14/23/11061",
-
DOI = "
doi:10.3390/app142311061",
-
abstract = "Symbolic regression is commonly used in domains where
both high accuracy and interpretability of models is
required. While symbolic regression is capable to
produce highly accurate models, small changes in the
training data might cause highly dissimilar solution.
The implications in practice are huge, as
interpretability as key-selling feature degrades when
minor changes in data cause substantially different
behaviour of models. We analyse those perturbations
caused by changes in training data for ten contemporary
symbolic regression algorithms. We analyse existing
machine learning models from the SRBench benchmark
suite, a benchmark that compares the accuracy of
several symbolic regression algorithms. We measure the
bias and variance of algorithms and show how algorithms
like Operon and GP-GOMEA return highly accurate models
with similar behaviour despite changes in training
data. Our results highlight that larger model sizes do
not imply different behaviour when training data
change. On the contrary, larger models effectively
prevent systematic errors. We also show how other
algorithms like ITEA or AIFeynman with the declared
goal of producing consistent results meet up to their
expectation of small and similar models.",
-
notes = "also known as \cite{app142311061}",
- }
Genetic Programming entries for
Lukas Kammerer
Gabriel Kronberger
Stephan M Winkler
Citations