Skip to main content

A Genetic Programming Approach to Data Clustering

  • Conference paper
Multimedia, Computer Graphics and Broadcasting (MulGraB 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 263))

Abstract

This paper presents a genetic programming (GP) to data clustering. The aim is to accurately classify a set of input data into their genuine clusters. The idea lies in discovering a mathematical function on clustering regularities and then utilize the rule to make a correct decision on the entities of each cluster. To this end, GP is incorporated into the clustering procedures. Each individual is represented by a parsing tree on the program set. Fitness function evaluates the quality of clustering with regard to similarity criteria. Crossover exchanges sub-trees between parental candidates in a positionally independent fashion. Mutation introduces (in part) a new sub-tree with a low probability. The variation operators (i.e., crossover, mutation) offer an effective search capability to obtain the improved quality of solution and the enhanced speed of convergence. Experimental results demonstrate that the proposed approach outperforms a well-known reference.

This research was supported by the MKE, Korea under the ITRC NIPA-2011-(C1090-1121-0008).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: A Review. ACM Computing Surveys 31(3) (September 1999)

    Google Scholar 

  2. Han, J., Kamber, M.: Data mining: Concepts and techniques. Morgan Kaufmann Publishers, San Francisco (2001)

    MATH  Google Scholar 

  3. Park, N.H., Ahn, C.W., Ramakrishna, R.S.: Adaptive Clustering Technique Using Genetic Algorithms. IEICE Trans. Inf. and Syst. E88-D(12), 2880–2882 (2005)

    Article  Google Scholar 

  4. Koza, J.R.: Genetic Programming On the programming of Computers by Means of Natural Selection. The MIT Press (1992)

    Google Scholar 

  5. Langdon, W.B.: Genetic Programming + Data Structures = Automatic Programming. The Kluwer International Series in Engineering and Computer Science. Kluwer Academic Publishers (1998)

    Google Scholar 

  6. Mitchell, T.M.: Machine Learning. Computer Science Series. McGRAW-HILL International Editions (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ahn, C.W., Oh, S., Oh, M. (2011). A Genetic Programming Approach to Data Clustering. In: Kim, Th., et al. Multimedia, Computer Graphics and Broadcasting. MulGraB 2011. Communications in Computer and Information Science, vol 263. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27186-1_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-27186-1_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-27185-4

  • Online ISBN: 978-3-642-27186-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics