Abstract
Visual analytics is a human-machine collaboration to data modeling where extraction of the most informative features plays an important role. Although feature extraction is a multi-objective task, the traditional algorithms either only consider one objective or aggregate the objectives into one scalar criterion to optimize. In this paper, we propose a Pareto-based multi-objective approach to feature extraction for visual analytics applied to data classification problems. We identify classifiability, visual interpretability and semantic interpretability as the three equally important objectives for feature extraction in classification problems and define various measures to quantify these objectives. Our results on a number of benchmark datasets show consistent improvement compared to three standard dimensionality reduction techniques. We also argue that exploration of the multiple Pareto-optimal models provide more insight about the classification problem as opposed to a single optimal solution.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Bellman, R.: Adaptive Control Processes. Princeton University Press, Princeton (1961)
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Chichester (2000)
Estèbanez, C., Aler, R., Valls, J.M.: A method based on genetic programming for improving the quality of datasets in classification problems. International Journal of Computer Science and Applications 4, 69–80 (2007)
Luke, S., et al.: Ecj: A java-based evolutionary computation research system., http://cs.gmu.edu/~eclab/projects/ecj/
Garcia-Osorio, C., Fyfe, C.: Comparing exploratory projection pursuit artificial neural networks. In: 3rd International Workshop on Practical Applications of Agents and Multiagent Systems, IWPAAMS 2004, pp. 201–210 (2004)
Guo, H., Nandi, A.K.: Breast cancer diagnosis using genetic programming generated feature. Pattern Recognition 39(5), 980–987 (2006)
Guyon, I., Gunn, S., Nikravesh, M., Zadeh, L. (eds.): Feature Extraction, Foundations and Applications. STUD FUZZ. Physica-Verlag, Springer, Heidelberg (2006)
Icke, I., Rosenberg, A.: Multi-objective genetic programming projection pursuit for exploratory data modeling. In: 5th Annual Machine Learning Symposium, New York Academy of Sciences (2010)
Icke, I., Rosenberg, A.: Symbolic regression for dimensionality reduction. In: Late Breaking Abstracts, GECCO 2010 (2010)
Jin, Y., Sendhoff, B.: Pareto-based multiobjective machine learning: An overview and case studies. IEEE Transactions on Systems, Man, and Cybernetics, Part C 38(3), 397–415 (2008)
Krawiec, K.: Genetic programming-based construction of features for machine learning and knowledge discovery tasks. Genetic Programming and Evolvable Machines 3(4), 329–343 (2002)
Lee, E.-K., Cook, D., Klinke, S., Lumley, T.: Projection pursuit for exploratory supervised classification. Journal of Computational and Graphical Statistics 14(4), 831–846 (2005)
Muharram, M., Smith, G.D.: Evolutionary constructive induction. IEEE Trans. on Knowl. and Data Eng. 17(11), 1518–1528 (2005)
Otero, F.E.B., Silva, M.M.S., Freitas, A.A., Nievola, J.C.: Genetic programming for attribute construction in data mining. In: Ryan, C., Soule, T., Keijzer, M., Tsang, E.P.K., Poli, R., Costa, E. (eds.) EuroGP 2003. LNCS, vol. 2610, pp. 384–393. Springer, Heidelberg (2003)
Sips, M., Neubert, B., Lewis, J.P., Hanrahan, P.: Selecting good views of high-dimensional data using class consistency. In: Hege, H.-C., Hotz, I., Munzner, T. (eds.) EuroVis 2009: Eurographics/ IEEE-VGTC Symposium on Visualization 2009. Computer Graphics Forum, vol. 28, pp. 831–838. Blackwell, Berlin (2009)
Tatu, A., Bak, P., Bertini, E., Keim, D.A., Schneidewind, J.: Visual quality metrics and human perception: an initial study on 2d projections of large multidimensional data. In: AVI-Proceedings of AVI Working Conference on Advanced Visual Interfaces, pp. 49–56 (2010)
Vladislavleva, E., Smits, G., den Hertog, D.: Order of nonlinearity as a complexity measure for models generated by symbolic regression via pareto genetic programming. IEEE Trans. Evolutionary Computation 13(2), 333–349 (2009)
Zhang, Y., Rockett, P.: A generic multi-dimensional feature extraction method using multiobjective genetic programming. Evolutionary Computation 17(1), 89–115 (2009)
Zitzler, E., Laumanns, M., Thiele, L.: Spea2: Improving the strength pareto evolutionary algorithm. Technical report, ETH Zentrum, Gloriastrasse 35, CH-8092 Zurich, Switzerland (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Icke, I., Rosenberg, A. (2011). Multi-objective Genetic Programming for Visual Analytics. In: Silva, S., Foster, J.A., Nicolau, M., Machado, P., Giacobini, M. (eds) Genetic Programming. EuroGP 2011. Lecture Notes in Computer Science, vol 6621. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20407-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-20407-4_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20406-7
Online ISBN: 978-3-642-20407-4
eBook Packages: Computer ScienceComputer Science (R0)