Semantic-aware Surrogate-assisted Genetic Programming for Feature Construction in Classification
Created by W.Langdon from
gp-bibliography.bib Revision:1.8638
- @InProceedings{DBLP:conf/cec/LiM25,
-
author = "Jiayi Li and Jianbin Ma",
-
title = "Semantic-aware Surrogate-assisted Genetic Programming
for Feature Construction in Classification",
-
booktitle = "2025 IEEE Congress on Evolutionary Computation (CEC)",
-
year = "2025",
-
editor = "Yaochu Jin and Thomas Baeck",
-
address = "Hangzhou, China",
-
month = "8-12 " # jun,
-
publisher = "IEEE",
-
keywords = "genetic algorithms, genetic programming, Measurement,
Machine learning algorithms, Computational modeling,
Semantics, Evolutionary computation, Machine learning,
Surrogate Model, Feature Construction, Classification",
-
isbn13 = "979-8-3315-3432-5",
-
timestamp = "Tue, 05 Aug 2025 01:00:00 +0200",
-
biburl = "
https://dblp.org/rec/conf/cec/LiM25.bib",
-
bibsource = "dblp computer science bibliography, https://dblp.org",
-
URL = "
https://doi.org/10.1109/CEC65147.2025.11042980",
-
DOI = "
10.1109/CEC65147.2025.11042980",
-
abstract = "Feature construction is a crucial preprocessing
technique in machine learning tasks, which aims at
constructing high-level features to improve the
performance of learning algorithms. Genetic Programming
(GP) is widely used in feature construction tasks due
to its flexible representation and powerful search
capabilities. Preselection strategy holds promise for
enhancing the performance of evolutionary algorithms.
It seeks to preselect better individuals prior to
fitness evaluation in order to improve population
quality. However, since GP individuals are represented
as non-numeric programs, it is challenging to
accurately assess whether an individual is promising.
This paper introduces a semantic-aware
surrogate-assisted method aimed at enhancing the
efficiency of GP in feature construction. We first
defined the semantics of multi-tree GP and propose
three distance metrics to measure similarity between
individuals. A KNN-based surrogate model is developed
to estimate the performance of candidate individuals.
Furthermore, a dynamic preselection strategy is
proposed to identify promising individuals from the
generated candidates. Experimental results on 10
classification datasets demonstrate significant
performance advantages of the proposed method over
three GP-based feature construction methods. Further
experiments demonstrated the value of the proposed
surrogate-assisted method and its key components.",
-
notes = "also known as \cite{li:2025:CEC9} \cite{11042980}",
- }
Genetic Programming entries for
Jiayi Li
Jianbin Ma
Citations