skip to main content
10.1145/3583133.3595830acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
abstract

Evolutionary Symbolic Regression: Mechanisms from the Perspectives of Morphology and Adaptability

Published:24 July 2023Publication History

ABSTRACT

Symbolic Regression (SR) is the task of finding closed-form analytical expressions that describe the relationship between variables in a dataset. In this work, werethink SR and introduce mechanisms from two perspectives: morphology and adaptability. Morphology: Man-made heuristics are typically utilized in SR algorithms to influence the morphology (or structure) of candidate expressions, potentially introducing unintentional bias and data leakage. To address this issue, we create a depth-aware mathematical language model trained on terminal walks of expression trees, as a replacement to these heuristics. Adaptability: We promote alternating fitness functions across generations, eliminating equations that perform well in only one fitness function and as a result, discover expressions that are closer to the true functional form. We demonstrate this by alternating fitness functions that quantify faithfulness to values (via MSE) and empirical derivatives (via a novel theoretically justified fitness metric coined MSEDI). Proof-of-concept: We combine these ideas into a minimalistic evolutionary SR algorithm that outperforms a suite of benchmark and state of-the-art SR algorithms in problems with unknown constants added, which we claim are more reflective of SR performance for real-world applications. Our claim is then strengthened by reproducing the superior performance on real-world regression datasets from SRBench. This Hot-of-the-Press paper summarizes the work K.S. Fong, S. Wongso and M. Motani, "Rethinking Symbolic Regression: Morphology and Adaptability in the Context of Evolutionary Algorithms", The Eleventh International Conference on Learning International Conference on Learning Representations (ICLR'23).

References

  1. John R Koza. Genetic programming. on the programming of computers by means of natural selection. Complex adaptive systems, 1992.Google ScholarGoogle Scholar
  2. Kei Sen Fong, et al. Rethinking symbolic regression: Morphology and adaptability in the context of evolutionary algorithms. In International Conference on Learning Representations, 2023.Google ScholarGoogle Scholar
  3. Tony Worm and Kenneth Chiu. Prioritized grammar enumeration: symbolic regression by dynamic programming. In Annual Conference on Genetic and Evolutionary Computation, pages 1021--1028, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Brenden K Petersen et al. Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients. In International Conference on Learning Representations, 2021.Google ScholarGoogle Scholar
  5. Iwo Bladek and Krzysztof Krawiec. Solving symbolic regression problems with formal constraints. In Genetic and Evolutionary Computation Conference, pages 977--984, 2019.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Mengjie Zhang, et al. Online program simplification in genetic programming. In Asia-Pacific Conference on Simulated Evolution and Learning, pages 592--600. Springer, 2006.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Christian Loftis, et al. Lattice thermal conductivity prediction using symbolic regression and machine learning. The Journal of Physical Chemistry A, 125(1):435--450, 2020.Google ScholarGoogle ScholarCross RefCross Ref
  8. Michael F Korns. A baseline symbolic regression algorithm. In Genetic Programming Theory and Practice X, pages 117--137. Springer, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  9. Patryk Orzechowski, et al. Where are we now? a large benchmark study of recent symbolic regression methods. In Genetic and Evolutionary Computation Conference, pages 1183--1190, 2018.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Terrell Mundhenk et al. Symbolic regression via deep reinforcement learning enhanced genetic programming seeding. Advances in Neural Information Processing Systems, 34:24912--24923, 2021.Google ScholarGoogle Scholar
  11. Ying Jin, et al. Bayesian symbolic regression. arXiv:1910.08892, 2019.Google ScholarGoogle Scholar
  12. Nguyen Quang Uy et al. Semantically-based crossover in genetic programming: application to real-valued symbolic regression. Genetic Programming and Evolvable Machines, 12(2):91--119, 2011.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Krzysztof Krawiec and Tomasz Pawlak. Approximating geometric crossover by semantic backpropagation. In Annual conference on Genetic and evolutionary computation, pages 941--948, 2013.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Michael Schmidt and Hod Lipson. Symbolic regression of implicit equations. In Genetic Programming Theory and Practice VII, pages 73--85. Springer, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  15. Michael Schmidt and Hod Lipson. Distilling free-form natural laws from experimental data. Science, 324(5923):81--85, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  16. Patrick Bateson. Adaptability and evolution. Interface Focus, 7(5):20160126, 2017.Google ScholarGoogle ScholarCross RefCross Ref
  17. William La Cava et al. Contemporary symbolic regression methods and their relative performance. arXiv:2107.14351, 2021.Google ScholarGoogle Scholar

Index Terms

  1. Evolutionary Symbolic Regression: Mechanisms from the Perspectives of Morphology and Adaptability

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        GECCO '23 Companion: Proceedings of the Companion Conference on Genetic and Evolutionary Computation
        July 2023
        2519 pages
        ISBN:9798400701207
        DOI:10.1145/3583133

        Copyright © 2023 Owner/Author(s)

        Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s).

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 July 2023

        Check for updates

        Qualifiers

        • abstract

        Acceptance Rates

        Overall Acceptance Rate1,669of4,410submissions,38%

        Upcoming Conference

        GECCO '24
        Genetic and Evolutionary Computation Conference
        July 14 - 18, 2024
        Melbourne , VIC , Australia
      • Article Metrics

        • Downloads (Last 12 months)51
        • Downloads (Last 6 weeks)9

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader