A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling

Tuite, Clíodhna; Agapitos, Alexandros; O’Neill, Michael; Brabazon, Anthony

doi:10.1007/978-3-642-20520-0_13

A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling

Clíodhna Tuite³⁰,
Alexandros Agapitos³⁰,
Michael O’Neill³⁰ &
…
Anthony Brabazon³⁰

Conference paper

1614 Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6625))

Abstract

This paper investigates the effects of early stopping as a method to counteract overfitting in evolutionary data modelling using Genetic Programming. Early stopping has been proposed as a method to avoid model overtraining, which has been shown to lead to a significant degradation of out-of-sample performance. If we assume some sort of performance metric maximisation, the most widely used early training stopping criterion is the moment within the learning process that an unbiased estimate of the performance of the model begins to decrease after a strictly monotonic increase through the earlier learning iterations. We are conducting an initial investigation on the effects of early stopping in the performance of Genetic Programming in symbolic regression and financial modelling. Empirical results suggest that early stopping using the above criterion increases the extrapolation abilities of symbolic regression models, but is by no means the optimal training-stopping criterion in the case of a real-world financial dataset.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Becker, L.A., Seshadri, M.: Comprehensibility and overfitting avoidance in genetic programming for technical trading rules. Worcester Polytechnic Institute, Computer Science Technical Report (2003)
Google Scholar
Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1996)
MATH Google Scholar
Brabazon, A., Dang, J., Dempsey, I., O’Neill, M., Edelman, D.: Natural computing in finance: a review (2010)
Google Scholar
Brabazon, A., O’Neill, M.: Biologically inspired algorithms for financial modelling. Springer, Heidelberg (2006)
MATH Google Scholar
Chauvin, Y.: Generalisation performance of overtrained back-propagation networks. In: EUROSIP Workshop, pp. 46–55 (1990)
Google Scholar
Dempsey, I., O’Neill, M., Brabazon, A.: Foundations in Grammatical Evolution for Dynamic Environments. Springer, Heidelberg (2009)
Book Google Scholar
Duda, R., Hart, P., Stork, D.: Pattern Classification, 2nd edn. John Wiley and Sons, Chichester (2001)
MATH Google Scholar
Mckay, R.I., Hoai, N.X., Whigham, P.A., Shan, Y., O’Neill, M.: Grammar-based Genetic Programming: a survey. Genetic Programming and Evolvable Machines 11(3-4), 365–396 (2010)
Article Google Scholar
Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)
MATH Google Scholar
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998)
Google Scholar
O’Neill, M., Hemberg, E., Gilligan, C., Bartley, E., McDermott, J., Brabazon, A.: GEVA: grammatical evolution in Java. ACM SIGEVOlution 3(2), 17–22 (2008)
Article Google Scholar
O’Neill, M., Ryan, C.: Grammatical Evolution: Evolutionary automatic programming in an arbitrary language. Springer, Netherlands (2003)
Book MATH Google Scholar
O’Neill, M., Vanneschi, L., Gustafson, S., Banzhaf, W.: Open issues in genetic programming. Genetic Programming and Evolvable Machines 11(3-4), 339–363 (2010)
Article Google Scholar
Paris, G., Robilliard, D., Fonlupt, C.: Exploring overfitting in genetic programming. In: Artificial Evolution, pp. 267–277. Springer, Heidelberg (2004)
Chapter Google Scholar
Prechelt, L.: Early stopping-but when? In: Neural Networks: Tricks of the trade, pp. 553–553 (1998)
Google Scholar
Sarle, W.S.: Stopped training and other remedies for overfitting. In: Proceedings of the 27th Symposium on the Interface of Computing Science and Statistics, pp. 352–360 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Financial Mathematics and Computation Cluster Natural Computing Research and Applications Group Complex and Adaptive Systems Laboratory, University College Dublin, Ireland
Clíodhna Tuite, Alexandros Agapitos, Michael O’Neill & Anthony Brabazon

Authors

Clíodhna Tuite
View author publications
You can also search for this author in PubMed Google Scholar
Alexandros Agapitos
View author publications
You can also search for this author in PubMed Google Scholar
Michael O’Neill
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Brabazon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Mathematics and Statistics, University of Strathclyde, 16 Richmond Street, G1 1XQ, Glasgow, UK
Cecilia Di Chio
School of Business, University College Dublin, 4, Belfield, Dublin, UK
Anthony Brabazon
Istituto Dalle Molle di Studi sull’Intelligenza Artificiale (IDSIA), Galleria 2, 6928, Manno-Lugano, Switzerland
Gianni A. Di Caro
Institute of Computer Science, University of Bremen, 28359, Bremen, Germany
Rolf Drechsler
Next Generation Intelligent Networks Research Center (nexGIN RC), National University of Computer & Emerging Sciences (FAST-NU), Islamabad, Pakistan
Muddassar Farooq
Department of Information Systems, Johannes Gutenberg-Universität, 55099, Mainz, Germany
Jörn Grahl
Mathematics and Computer Science Depatment, University of Richmond, 23173, Richmond, VA, USA
Gary Greenfield
ICD - OSI, University of Technology of Troyes, 12, Rue Marie Curie, BP 2060, 10010, Troyes, France
Christian Prins
Facultad de Informatica, Universidade de A Coruña, 15071, Coruña, CP, Spain
Juan Romero
Dipartimento di Automatica e Informatica, Politecnico di Torino, Italy
Giovanni Squillero
Consiglio Nazionale delle Ricerche (CNR), Istituto di Calcolo e Reti ad Alte Prestazioni (ICAR), Via P. Castellino 111, 80131, Naples, Italy
Ernesto Tarantino
Dipartimento di Tecnologie dell’Informazione, Università degli Studi di Milano, Via Bramante 65, 26013, Crema (CR), Italy
Andrea G. B. Tettamanzi
School of Computing, Edinburgh Napier University, 10 Clinton Road, EH10 5DT, Edinburgh, UK
Neil Urquhart
Computer Engineering Department, Istanbul Technical University, Room Nr. 3302, Maslak, 34469, Istanbul, Turkey
A. Şima Uyar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tuite, C., Agapitos, A., O’Neill, M., Brabazon, A. (2011). A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling. In: Di Chio, C., et al. Applications of Evolutionary Computation. EvoApplications 2011. Lecture Notes in Computer Science, vol 6625. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20520-0_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-20520-0_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20519-4
Online ISBN: 978-3-642-20520-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics