GP with a Hybrid Tree-vector Representation for Instance Selection and Symbolic Regression on Incomplete Data
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @InProceedings{Al-Helali:2021:CEC,
-
author = "Baligh Al-Helali and Qi Chen and Bing Xue and
Mengjie Zhang",
-
booktitle = "2021 IEEE Congress on Evolutionary Computation (CEC)",
-
title = "{GP} with a Hybrid Tree-vector Representation for
Instance Selection and Symbolic Regression on
Incomplete Data",
-
year = "2021",
-
editor = "Yew-Soon Ong",
-
pages = "604--611",
-
address = "Krakow, Poland",
-
month = "28 " # jun # "-1 " # jul,
-
isbn13 = "978-1-7281-8393-0",
-
abstract = "Data incompleteness is a pervasive problem in symbolic
regression, and machine learning in general.
Unfortunately, most symbolic regression methods are
only applicable when the given data is complete. One
common approach to handling this situation is data
imputation. It works by estimating missing values based
on existing data. However, which existing data should
be used for imputing the missing values? The answer to
this question is important when dealing with incomplete
data. To address this question, this work proposes a
mixed tree-vector representation for genetic
programming to perform instance selection and symbolic
regression on incomplete data. In this representation,
each individual has two components: an expression tree
and a bit vector. While the tree component constructs
symbolic regression models, the vector component
selects the instances that are used to impute missing
values by the weighted k-nearest neighbour (WKNN)
imputation method. The complete imputed instances are
then used to evaluate the GP-based symbolic regression
model. The obtained experimental results show the
applicability of the proposed method on real-world data
sets with different missingness scenarios. When
compared with existing methods, the proposed method not
only produces more effective symbolic regression models
but also achieves more efficient imputations.",
-
keywords = "genetic algorithms, genetic programming, Computational
modeling, Machine learning, Evolutionary computation,
Regression tree analysis, Symbolic Regression,
Incomplete Data, Imputation, Instance Selection",
-
DOI = "doi:10.1109/CEC45853.2021.9504767",
-
notes = "Also known as \cite{9504767}",
- }
Genetic Programming entries for
Baligh Al-Helali
Qi Chen
Bing Xue
Mengjie Zhang
Citations