Skip to main content

Symbolic Regression Is Not Enough: It Takes a Village to Raise a Model

  • Chapter
  • First Online:
Genetic Programming Theory and Practice X

Abstract

From a real-world perspective, good enough has been achieved in the core representations and evolutionary strategies of genetic programming assuming state-of-the-art algorithms and implementations are being used. What is needed for industrial symbolic regression are tools to (a) explore and refine the data, (b) explore the developed model space and extract insight and guidance from the available sample of the infinite possibilities of model forms and (c) identify appropriate models for deployment as predictors, emulators, etc. This chapter focuses on the approaches used in DataModeler to address the modeling life cycle. A special focus in this chapter is the identification of driving variables and metavariables. Exploiting the diversity of search paths followed during independent evolutions and, then, looking at the distributions of variables and metavariable usage also provides an opportunity to gather key insights. The goal in this framework, however, is not to replace the modeler but, rather, to augment the inclusion of context and collection of insight by removing mechanistic requirements and facilitating the ability to think. We believe that the net result is higher quality and more robust models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bleuler S, Brack M, Thiele L, Zitzler E (2001) Multiobjective Genetic Programming: Reducing Bloat by Using SPEA2. In: Congress on Evolutionary Computation (CEC 2001), IEEE, Piscataway, NJ, pp 536–543

    Google Scholar 

  • Evolved Analytics (2011) DataModeler Release 8.0 Documentation. Evolved Analytics LLC, URL http://www.evolved-analytics.com

  • Kotanchek M, Smits G, Vladislavleva E (2006) Pursuing the pareto paradigm tournaments, algorithm variations & ordinal optimization. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice IV, Genetic and Evolutionary Computation, vol 5, Springer, Ann Arbor, chap 12, pp 167–186

    Google Scholar 

  • Kotanchek M, Smits G, Vladislavleva E (2008) Exploiting trustable models via pareto GP for targeted data collection. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice VI, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 10, pp 145–163

    Google Scholar 

  • Kotanchek ME, Vladislavleva EY, Smits GF (2009) Symbolic regression via GP as a discovery engine: Insights on outliers and prototypes. In: Riolo RL, O’Reilly UM, McConaghy T (eds) Genetic Programming Theory and Practice VII, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 4, pp 55–72

    Google Scholar 

  • McConaghy T (2008) Variation-aware structural synthesis and knowledge extraction of analog circuits. PhD thesis, Katholieke Universiteit Leuven, Leuven, Belgium

    Google Scholar 

  • Moore JH, White BC (2006) Genome-wide genetic analysis using genetic programming: The critical need for expert knowledge. In: Riolo RL, Soule T, Worzel B (eds) Genetic Programming Theory and Practice IV, Genetic and Evolutionary Computation, vol 5, Springer, Ann Arbor, chap 11, pp –

    Google Scholar 

  • Schmidt M, Lipson H (2009) Symbolic regression of implicit equations. In: Riolo RL, O’Reilly UM, McConaghy T (eds) Genetic Programming Theory and Practice VII, Genetic and Evolutionary Computation, Springer, Ann Arbor, chap 5, pp 73–85

    Google Scholar 

  • Smits G, Kotanchek M (2004) Pareto-front exploitation in symbolic regression. In: O’Reilly UM, Yu T, Riolo RL, Worzel B (eds) Genetic Programming Theory and Practice II, Springer, Ann Arbor, chap 17, pp 283–299, DOI doi:10.1007/0-387-23254-0-17

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mark E. Kotanchek .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media New York

About this chapter

Cite this chapter

Kotanchek, M.E., Vladislavleva, E., Smits, G. (2013). Symbolic Regression Is Not Enough: It Takes a Village to Raise a Model. In: Riolo, R., Vladislavleva, E., Ritchie, M., Moore, J. (eds) Genetic Programming Theory and Practice X. Genetic and Evolutionary Computation. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6846-2_13

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-6846-2_13

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-6845-5

  • Online ISBN: 978-1-4614-6846-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics