Ensemble Learning and Pruning in Multi-Objective Genetic Programming for Classification with Unbalanced Data

Bhowan, Urvesh; Johnston, Mark; Zhang, Mengjie

doi:10.1007/978-3-642-25832-9_20

Ensemble Learning and Pruning in Multi-Objective Genetic Programming for Classification with Unbalanced Data

Urvesh Bhowan²¹,
Mark Johnston²¹ &
Mengjie Zhang²¹

Conference paper

2339 Accesses
13 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7106))

Abstract

Machine learning algorithms can suffer a performance bias when data sets are unbalanced. This paper develops a multi-objective genetic programming approach to evolving accurate and diverse ensembles of non-dominated solutions where members vote on class membership. We explore why the ensembles can also be vulnerable to the learning bias using a range of unbalanced data sets. Based on the notion that smaller ensembles can be better than larger ensembles, we develop a new evolutionary-based pruning method to find groups of highly-cooperative individuals that can improve accuracy on the important minority class.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Weiss, G.M., Provost, F.: Learning when training data are costly: The effect of class distribution on tree induction. Journal of Artificial Intelligence Research 19, 315–354 (2003)
MATH Google Scholar
Chawla, N.V., Sylvester, J.: Exploiting Diversity in Ensembles: Improving the Performance on Unbalanced Datasets. In: Haindl, M., Kittler, J., Roli, F. (eds.) MCS 2007. LNCS, vol. 4472, pp. 397–406. Springer, Heidelberg (2007)
Chapter Google Scholar
Mclntyre, A., Heywood, M.: Multi-objective competitive coevolution for efficient GP classifier problem decomposition. In: IEEE International Conference on Systems, Man and Cybernetics, pp. 1930–1937 (2007)
Google Scholar
Wang, S., Tang, K., Yao, X.: Diversity exploration and negative correlation learning on imbalanced data sets. In: International Joint Conference on Neural Networks, pp. 3259–3266 (2009)
Google Scholar
Koza, J.R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press (1992)
Google Scholar
Holmes, J.H.: Differential negative reinforcement improves classifier system learning rate in two-class problems with unequal base rates. In: Koza, J.R., Banzhaf, W., Chellapilla, K., et al. (eds.) Genetic Programming 1998: Proceedings of the Third Annual Conference, pp. 635–644 (1998)
Google Scholar
Bhowan, U., Zhang, M., Johnston, M.: Evolving ensembles in multi-objective genetic programming for classification with unbalanced data. In: Proceedings of 2011 Genetic and Evolutionary Computation Conference, pp. 1331–1339. ACM (2011)
Google Scholar
Zitzler, E., Laumanns, M., Thiele, L.: Spea2: Improving the strength pareto evolutionary algorithm for multiobjective optimization. Technical report (2001), TIK-Report 103, Department of Electrical Engineering, Swiss Federal Institute of Technology
Google Scholar
Jin, Y., Sendhoff, B.: Pareto-based multiobjective machine learning: An overview and case studies. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews 38, 397–415 (2008)
Google Scholar
Chandra, A., Yao, X.: Ensemble learning using multi-objective evolutionary algorithms. Journal of Mathematical Modelling and Algorithms 5, 417–445 (2006)
Article MathSciNet MATH Google Scholar
Chen, H., Tino, P., Yao, X.: Predictive ensemble pruning by expectation propagation. IEEE Transactions on Knowledge and Data Engineering 21, 999–1013 (2009)
Article Google Scholar
Asuncion, A., Newman, D.: UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences (2007), http://www.ics.uci.edu/~mlearn/MLRepository.html
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: An update. SIGKDD Explorations 11 (1) (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Evolutionary Computation Research Group, Victoria University of Wellington, New Zealand
Urvesh Bhowan, Mark Johnston & Mengjie Zhang

Authors

Urvesh Bhowan
View author publications
You can also search for this author in PubMed Google Scholar
Mark Johnston
View author publications
You can also search for this author in PubMed Google Scholar
Mengjie Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering and Mathematical Sciences, La Trobe University, 3086, Melbourne, VIC, Australia
Dianhui Wang
School of Computer Science and Software Engineering, The University of Western Australia, 6009, Perth, WA, Australia
Mark Reynolds

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhowan, U., Johnston, M., Zhang, M. (2011). Ensemble Learning and Pruning in Multi-Objective Genetic Programming for Classification with Unbalanced Data. In: Wang, D., Reynolds, M. (eds) AI 2011: Advances in Artificial Intelligence. AI 2011. Lecture Notes in Computer Science(), vol 7106. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25832-9_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-25832-9_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25831-2
Online ISBN: 978-3-642-25832-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics