Abstract
An extension of Cellular Genetic Programming for data classification with the boosting technique is presented and a comparison with the bagging-like majority voting approach is performed. The method is able to deal with large data sets that do not fit in main memory since each classifier is trained on a subset of the overall training data. Experiments showed that, by using a sample of reasonable size, the extension with these voting algorithms enhances classification accuracy at a much lower computational cost.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bauer, E., Kohavi, R.: An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Machine Learning (36), 105–139 (1999)
Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
Fayyad, U.M., Piatesky-Shapiro, G., Smith, P.: From data mining to knowledge discovery: an overview. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–34. AAAI/MIT Press (1996)
Folino, G., Pizzuti, C., Spezzano, G.: A cellular genetic programming approach to classification. In: Proc. Of the Genetic and Evolutionary Computation Conference GECCO 1999, Orlando, Florida, July 1999, pp. 1015–1020. Morgan Kaufmann, San Francisco (1999)
Folino, G., Pizzuti, C., Spezzano, G.: Ensemble techniques for parallel genetic programming based classifiers. In: Ryan, C., Soule, T., Keijzer, M., Tsang, E.P.K., Poli, R., Costa, E. (eds.) EuroGP 2003. LNCS, vol. 2610, pp. 59–69. Springer, Heidelberg (2003)
Folino, G., Pizzuti, C., Spezzano, G.: A scalable cellular implementation of parallel genetic programming. IEEE Transaction on Evolutionary Computation 7(1), 37–53 (2003)
Freitas, A.A.: A genetic programming framework for two data mining tasks: Classification and generalised rule induction. In: Proceedings of the 2nd Int. Conference on Genetic Programming, Stanford University, CA, USA, pp. 96–101 (1997)
Freund, Y., Scapire, R.: Experiments with a new boosting algorithm. In: Proceedings of the 13th Int. Conference on Machine Learning, pp. 148–156 (1996)
Iba, H.: Bagging, boosting, and bloating in genetic programming. In: Proc. Of the Genetic and Evolutionary Computation Conference GECCO 1999, Orlando, Florida, July 1999, pp. 1053–1060. Morgan Kaufmann, San Francisco (1999)
Koza, J.R.: Genetic Programming: On the Programming of Computers by means of Natural Selection. MIT Press, Cambridge (1992)
Marmelstein, R.E., Lamont, G.B.: Pattern classification using a hybbrid genetic program - decision tree approach. In: Proceedings of the Third Annual Conference on Genetic Programming. Morgan Kaufmann, San Francisco (1998)
Merz, C.J., Murphy, P.M.: In: UCI repository of Machine Learning (1996), http://www.ics.uci/mlearn/MLRepository.html
Nikolaev, N.I., Slavov, V.: Inductive genetic programming with decision trees. In: Proceedings of the 9th International Conference on Machine Learning, Prague, Czech Republic (1997)
Ross Quinlan, J.: Bagging, boosting, and c4.5. In: Proceedings of the 13th National Conference on Artificial Intelligence AAAI 1996, pp. 725–730. Mit Press, Cambridge (1996)
Ryan, M.D., Rayward-Smith, V.J.: The evolution of decision trees. In: Proceedings of the Third Annual Conference on Genetic Programming, Morgan Kaufmann, San Francisco (1998)
Schapire, R.E.: The strength of weak learnability. Machine Learning 5(2), 197–227 (1990)
Sreenivas, M., AlSabti, K., Ranka, S.: Parallel out-of-core decision tree classifiers. In: Kargupta, H., Chan, P. (eds.) in Advances in Distributed Data Mining, Menlo Park, AAAI Press, Menlo Park (1996)
Srivastava, E., Han, V.: Kumar, and V. Singh. Parallel formulations of decision tree classification algorithms. Data Mining and Knowledge Discovery 3(2), 237–261 (1999)
Tomassini, M.: Parallel and distributed evolutionary algorithms: A review. In: Neittaanmäki, P., Miettinen, K., Mäkelä, M., Periaux, J. (eds.) Evolutionary Algorithms in Engineering and Computer Science. J. Wiley and Sons, Chichester (1999)
Zaki, M., Parthasarathy, S., Ogihara, M., Li, W.: Parallel algorithms for discovery of association rules. Data Mining and Knowledge Discovery, special issue on Scalable High Performance Computing, 1(4), 343–373 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Folino, G., Pizzuti, C., Spezzano, G. (2004). Boosting Technique for Combining Cellular GP Classifiers. In: Keijzer, M., O’Reilly, UM., Lucas, S., Costa, E., Soule, T. (eds) Genetic Programming. EuroGP 2004. Lecture Notes in Computer Science, vol 3003. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24650-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-540-24650-3_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21346-8
Online ISBN: 978-3-540-24650-3
eBook Packages: Springer Book Archive