A Filter-Based Feature Selection and Ranking Approach to Enhance Genetic Programming for High-Dimensional Data Analysis

Mohammad Sadegh Khorshidi, Danial Yazdani, Jacek Mandziuk, Mohammad Reza Nikoo, Amir H. Gandomi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Genetic programming (GP), as a predictive data analytic tool, has difficulties dealing with high-dimensional problems. Therefore, some GP variants have been proposed for this type of problem, such as multi-stage GP (MSGP). Filter-based feature selection is commonly used in the literature for various machine learning purposes. However, its application for GP is overlooked due to GP's capability to operate as a wrapper-based feature selection while trying to find an optimal expression of the target variable via a functional combination of predictors. The effectiveness of wrapper- and filer-based feature selection approaches in machine learning has been the subject of a long-standing debate in the literature. This study aims to introduce an efficient feature selection approach and couple it with MSGP in order to handle high-dimensional problems. In addition, the stages of the GP are systematically ordered based on the variables' information. The proposed approach is tested against five real high-dimensional datasets. The results show that GP's inherent wrapper feature selection ability can be advanced further by using a filter-based feature selection approach to shrink the search space, which results in improving computational costs, expression complexity and the accuracy of MSGP.

Original languageEnglish
Title of host publication2023 IEEE Congress on Evolutionary Computation, CEC 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-9
Number of pages9
ISBN (Electronic)9798350314588
ISBN (Print)9798350314588
DOIs
Publication statusPublished - Jul 1 2023
Event2023 IEEE Congress on Evolutionary Computation, CEC 2023 - Chicago, United States
Duration: Jul 1 2023Jul 5 2023

Publication series

Name2023 IEEE Congress on Evolutionary Computation (CEC)

Conference

Conference2023 IEEE Congress on Evolutionary Computation, CEC 2023
Country/TerritoryUnited States
CityChicago
Period7/1/237/5/23

Keywords

  • Data Analytics
  • Feature Ranking
  • Feature Selection
  • High-Dimensional Data
  • Information Theory
  • Multi-Stage Genetic Programming

ASJC Scopus subject areas

  • Computer Science Applications
  • Computational Mathematics
  • Control and Optimization
  • Modelling and Simulation

Cite this