skip to main content
10.1145/3512290.3528695acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Transformation-interaction-rational representation for symbolic regression

Published:08 July 2022Publication History

ABSTRACT

Symbolic Regression searches for a function form that approximates a dataset often using Genetic Programming. Since there is usually no restriction to what form the function can have, Genetic Programming may return a hard to understand model due to non-linear function chaining or long expressions. A novel representation called Interaction-Transformation was recently proposed to alleviate this problem. In this representation, the function form is restricted to an affine combination of terms generated as the application of a single univariate function to the interaction of selected variables. This representation obtained competing solutions on standard benchmarks. Despite the initial success, a broader set of benchmarking functions revealed the limitations of the constrained representation. In this paper we propose an extension to this representation, called Transformation-Interaction-Rational representation that defines a new function form as the rational of two Interaction-Transformation functions. Additionally, the target variable can also be transformed with an univariate function. The main goal is to improve the approximation power while still constraining the overall complexity of the expression. We tested this representation with a standard Genetic Programming with crossover and mutation. The results show a great improvement when compared to its predecessor and a state-of-the-art performance for a large benchmark.

References

  1. Guilherme Seidyo Imai Aldeia and Fabricio Olivetti de Franca. 2020. A Parametric Study of Interaction-Transformation Evolutionary Algorithm for Symbolic Regression. In 2020 IEEE Congress on Evolutionary Computation (CEC). IEEE, New York, 8 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Guilherme Seidyo Imai Aldeia and Fabrício Olivetti de França. 2018. Lightweight Symbolic Regression with the Interaction - Transformation Representation. In 2018 IEEE Congress on Evolutionary Computation (CEC). IEEE, New York, 8 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Ignacio Arnaldo, Krzysztof Krawiec, and Una-May O'Reilly. 2014. Multiple regression genetic programming. In Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation. ACM, 879--886.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bogdan Burlacu, Gabriel Kronberger, and Michael Kommenda. 2020. Operon C++ an eficient genetic programming framework for symbolic regression. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion. 1562--1570.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. O. de Franca and G. S. I. Aldeia. 2020. Interaction-Transformation Evolutionary Algorithm for Symbolic Regression. Evolutionary Computation (12 2020), 1--25. arXiv:https://direct.mit.edu/evco/article-pdf/doi/10.1162/evco_a_00285/1888497/evco_a_00285.pdf Google ScholarGoogle ScholarCross RefCross Ref
  6. Fabricio Olivetti de Franca and Maira Zabuscha de Lima. 2021. Interaction-transformation symbolic regression with extreme learning machine. Neurocomputing 423 (2021), 609--619.Google ScholarGoogle ScholarCross RefCross Ref
  7. Grant Dick. 2014. Bloat and generalisation in symbolic regression. In Asia-Pacific Conference on Simulated Evolution and Learning. Springer, 491--502.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Norbert Gaffke and Berthold Heiligers. 1996. 30 Approximate designs for polynomial regression: Invariance, admissibility, and optimality. Handbook of Statistics 13 (1996), 1149--1199.Google ScholarGoogle ScholarCross RefCross Ref
  9. Andrew Gelman, Jennifer Hill, and Aki Vehtari. 2020. Regression and other stories. Cambridge University Press.Google ScholarGoogle Scholar
  10. Alexandre Goldsztejn. 2008. Modal intervals revisited part 1: A generalized interval natural extension. (2008).Google ScholarGoogle Scholar
  11. Frank E Harrell. 2017. Regression modeling strategies. Bios 330, 2018 (2017), 14.Google ScholarGoogle Scholar
  12. Timothy Hickey, Qun Ju, and Maarten H Van Emden. 2001. Interval arithmetic: From principles to implementation. Journal of the ACM (JACM) 48, 5 (2001), 1038--1068.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Kevin Jamieson and Ameet Talwalkar. 2016. Non-stochastic best arm identification and hyperparameter optimization. In Artificial Intelligence and Statistics. PMLR, 240--248.Google ScholarGoogle Scholar
  14. Daniel Kantor, Fernando J Von Zuben, and Fabricio Olivetti de Franca. 2021. Simulated annealing for symbolic regression. In Proceedings of the Genetic and Evolutionary Computation Conference. 592--599.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Robert E Kass. 1990. Nonlinear regression analysis and its applications. J. Amer. Statist. Assoc. 85, 410 (1990), 594--596.Google ScholarGoogle ScholarCross RefCross Ref
  16. Maarten Keijzer. 2004. Scaled symbolic regression. Genetic Programming and Evolvable Machines 5, 3 (2004), 259--269.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Michael Kommenda, Bogdan Burlacu, Gabriel Kronberger, and Michael Affenzeller. 2020. Parameter identification for symbolic regression using nonlinear least squares. Genetic Programming and Evolvable Machines 21, 3 (2020), 471--501.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. John R Koza et al. 1994. Genetic programming II. Vol. 17. MIT press Cambridge, MA.Google ScholarGoogle Scholar
  19. William La Cava and Jason H. Moore. 2019. Semantic Variation Operators for Multidimensional Genetic Programming. In Proceedings of the Genetic and Evolutionary Computation Conference (Prague, Czech Republic) (GECCO '19). ACM, New York, NY, USA, 1056--1064. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. William La Cava, Patryk Orzechowski, Bogdan Burlacu, Fabricio Olivetti de França, Marco Virgolin, Ying Jin, Michael Kommenda, and Jason H. Moore. 2021. Contemporary Symbolic Regression Methods and their Relative Performance. In Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks. https://openreview.net/pdf?id=xVQMrDLyGstGoogle ScholarGoogle Scholar
  21. William La Cava, Tilak Raj Singh, James Taggart, Srinivas Suri, and Jason Moore. 2019. Learning concise representations for regression by evolving networks of trees. In International Conference on Learning Representations. https://openreview.net/forum?id=Hke-JhA9Y7Google ScholarGoogle Scholar
  22. William B Langdon. 1999. Size fair and homologous tree genetic programming crossovers. In Proceedings of the 1st Annual Conference on Genetic and Evolutionary Computation-Volume 2. Morgan Kaufmann Publishers Inc., 1092--1097.Google ScholarGoogle Scholar
  23. SH Alizadeh Moghaddam, M Mokhtarzade, A Alizadeh Naeini, and SA Alizadeh Moghaddama. 2017. Statistical method to overcome overfitting issue in rational function models. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences 42, 4/W4 (2017).Google ScholarGoogle ScholarCross RefCross Ref
  24. Fabrício Olivetti de França. 2018. A greedy search tree heuristic for symbolic regression. Information Sciences 442-443 (2018), 18 -- 32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Riccardo Poli, William B Langdon, Nicholas F McPhee, and John R Koza. 2008. A field guide to genetic programming. Lulu. com.Google ScholarGoogle Scholar
  26. Veli-Matti Taavitsainen. 2010. Ridge and PLS based rational function regression. Journal of chemometrics 24, 11-12 (2010), 665--673.Google ScholarGoogle ScholarCross RefCross Ref
  27. Veli-Matti Taavitsainen. 2013. Rational function ridge regression in kinetic modeling: A case study. Chemometrics and Intelligent Laboratory Systems 120 (2013), 136--141.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Transformation-interaction-rational representation for symbolic regression

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      GECCO '22: Proceedings of the Genetic and Evolutionary Computation Conference
      July 2022
      1472 pages
      ISBN:9781450392372
      DOI:10.1145/3512290

      Copyright © 2022 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 8 July 2022

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,669of4,410submissions,38%

      Upcoming Conference

      GECCO '24
      Genetic and Evolutionary Computation Conference
      July 14 - 18, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader