skip to main content
10.1145/3377929.3390078acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Feature engineering for improving robustness of crossover in symbolic regression

Published:08 July 2020Publication History

ABSTRACT

Isolating the fitness-contribution of substructures is typically a difficult task in Genetic Programming (GP). Hence, useful substructures are lost when the overall structure (model) performs poorly. Furthermore, while crossover is heavily used in GP, it typically produces offspring models with significantly lower fitness than that of the parents. In symbolic regression, this degradation also occurs because the coefficients of an evolving model lose utility after crossover. This paper proposes isolating the fitness-contribution of various substructures and reducing the negative impact of crossover by evolving a set of features instead of monolithic models. The method then leverages multiple linear regression (MLR) to optimise the coefficients of these features. Since adding new features cannot degrade the accuracy of an MLR produced model, MLR-aided GP models can bloat. To penalise such additions, we use Adjusted R2 as the fitness function. The paper compares the proposed method with standard GP and GP with linear scaling. Experimental results show that the proposed method matches the accuracy of the competing methods within only 1/10th of the number of generations. Also, the method significantly decreases the rate of post-crossover fitness degradation.

References

  1. Cesar L. Alonso, Jose Luis Montana, and Cruz Enrique Borges. 2009. Evolution Strategies for Constants Optimization in Genetic Programming. In 21st International Conference on Tools with Artificial Intelligence, ICTAI '09. IEEE, Newark, NJ, USA, 703--707. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Peter J. Angeline. 1997. Subtree Crossover: Building Block Engine or Macro-mutation?. In Genetic Programming 1997: Proceedings of the Second Annual Conference, Koza et al (Ed.). Morgan Kaufmann, Stanford University, CA, USA, 9--17. http://ncra.ucd.ie/COMP41190/SubtreeXoverBuildingBlockorMacromutation_angeline_gp97.psGoogle ScholarGoogle Scholar
  3. Raja Muhammad Atif Azad and Conor Ryan. 2014. A Simple Approach to Lifetime Learning in Genetic Programming based Symbolic Regression. Evolutionary Computation 22, 2 (jul 2014), 287--317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Wolfgang Banzhaf, Peter Nordin, Robert E. Keller, and Frank D. Francone. 1998. Genetic Programming - An Introduction; On the Automatic Evolution of Computer Programs and its Applications. Morgan Kaufmann, San Francisco, CA, USA. https://www.amazon.co.uk/Genetic-Programming-Introduction-Artificial-Intelligence/dp/155860510XGoogle ScholarGoogle ScholarDigital LibraryDigital Library
  5. Shu-Heng Chen and Tzu-Wen Kuo. 2003. Overfitting or Poor Learning: A Critique of Current Financial Applications of GP. In Genetic Programming, Proceedings of EuroGP'2003 (LNCS), Ryan et al (Ed.), Vol. 2610. Springer-Verlag, Essex, 34--46. Google ScholarGoogle ScholarCross RefCross Ref
  6. Hammad Majeed, Conor Ryan, and R. Muhammad Atif Azad. 2005. Evaluating GP schema in context. In Proceedings of the 7th annual conference on Genetic and evolutionary computation. ACM, New York, NY, USA, 1773--1774.Google ScholarGoogle Scholar
  7. Nicholas Freitag McPhee, Brian Ohs, and Tyler Hutchison. 2008. Semantic Building Blocks in Genetic Programming. In Proceedings of the 11th European Conference on Genetic Programming, EuroGP 2008 (Lecture Notes in Computer Science), O'Neill et al (Ed.), Vol. 4971. Springer, Naples, 134--145. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Feature engineering for improving robustness of crossover in symbolic regression

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          GECCO '20: Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion
          July 2020
          1982 pages
          ISBN:9781450371278
          DOI:10.1145/3377929

          Copyright © 2020 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 8 July 2020

          Check for updates

          Qualifiers

          • poster

          Acceptance Rates

          Overall Acceptance Rate1,669of4,410submissions,38%

          Upcoming Conference

          GECCO '24
          Genetic and Evolutionary Computation Conference
          July 14 - 18, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader