Skip to main content

A Genetic Programming Approach Applied to Feature Selection from Medical Data

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 803))

Abstract

Genetic programming represents a flexible and powerful evolutionary technique in machine learning. The use of genetic programming for rule induction has generated interesting results in classification problems. This paper proposes an evolutionary approach for logical rule induction, which is applied to clinical data. Since logical rules disclose knowledge from the analyzed data, we use such a knowledge to filter features from the target dataset. The results reached by the used dataset have been very promising when used in classification tasks and compared with other methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bandyopadhyay, S., Pal, S.K.: Classification and Learning Using Genetic Algorithms: Applications in Bioinformatics and Web Intelligence. Natural Computing Series. Springer, Heidelberg (2007). https://doi.org/10.1007/3-540-49607-6

    Book  MATH  Google Scholar 

  2. Bonelli, P., Parodi, A.: An efficient classifier system and its experimental comparison with two representative learning methods on three medical domains. In: Proceedings of the 4th International Conference on Genetic Algorithms (ICGA), pp. 288–295 (1991)

    Google Scholar 

  3. Hong, J.H., Cho, S.B.: The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming. Artif. Intell. Med. 36, 43–58 (2006)

    Article  Google Scholar 

  4. Kumar, T.P., Iba, H.: Prediction of cancer class with majority voting genetic programming classifier using gene expression data. IEEE/ACM Trans. Comput. Biol. Bioinf. 6, 353–367 (2009)

    Article  Google Scholar 

  5. Kumar, R., Verma, R.: Classification rule discovery for diabetes patients by using genetic programming. Int. J. Soft Comput. Eng. (IJSCE) 2, 183–185 (2012)

    Google Scholar 

  6. Larraaga, P., et al.: Machine learning in bioinformatics. Briefings Bioinf. 7, 86–112 (2006)

    Article  Google Scholar 

  7. Liu, K.H., Xu, C.G.: A genetic programming-based approach to the classification of multiclass microarray datasets. Bioinformatics 25, 331–337 (2009)

    Article  Google Scholar 

  8. Maulik, U., Bandyopadhyay, S., Mukhopadhyay, A.: Multiobjective Genetic Algorithms for Clustering: Applications in Data Mining and Bioinformatics. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-16615-0

    Book  MATH  Google Scholar 

  9. Pea-Reyes, C.A., Sipper, M.: Evolutionary computation in medicine: an overview. Artif. Intell. Med. 19, 1–23 (2000)

    Article  Google Scholar 

  10. Podgorelec, V., Kokol, P., Stiglic, M.M., Hericko, M., Rozrnan, I.: Knowledge discovery with classification rules in a cardiovascular dataset. Comput. Methods Program. Biomed. 1, 539–549 (2005)

    Google Scholar 

  11. Soni, J., Ansari, U., Sharma, D., Soni, S.: Intelligent and effective heart disease prediction system using weighted associative classifiers. Int. J. Comput. Sci. Eng. (IJCSE) 3, 2385–2392 (2011)

    Google Scholar 

  12. Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artif. Intell. Med. 32, 195–216 (2004)

    Article  Google Scholar 

  13. Vargas, C.M.B., Chidambaram, C., Hembecker, F., Silvério, H.L.: A comparative study of machine learning and evolutionary computation approaches for protein secondary structure classification. In: Computational Biology and Applied Bioinformatics, pp. 239–258. InTech (2011)

    Google Scholar 

  14. Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. USA 87, 9193–9196 (1990)

    Article  Google Scholar 

  15. Lucas, P.: Analysis of notions of diagnosis. Artif. Intell. 12(105), 295–343 (1998)

    Article  MathSciNet  Google Scholar 

  16. Lucas, P.: Prognostic methods in medicine. Artif. Intell. 15, 105–119 (1999)

    Article  Google Scholar 

  17. Ramos, J., Castellanos-Garzón, J.A., González-Briones, A., de Paz, J.F., Corchado, J.M.: An agent-based clustering approach for gene selection in gene expression microarray. Interdiscip. Sci. Comput. Life Sci. 9, 1–13 (2017)

    Article  Google Scholar 

  18. Castellanos-Garzón, J.A., Ramos, J., González-Briones, A., de Paz, J.F.: A clustering-based method for gene selection to classify tissue samples in lung cancer. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., Domínguez Mayo, F., De Paz, J. (eds.) PACBB 2016. AISC, vol. 477, pp. 99–107. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-40126-3_11

    Chapter  Google Scholar 

  19. Castellanos-Garzón, J.A., Ramos, J.: A gene selection approach based on clustering for classification tasks in colon cancer. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(3), 1–10 (2015)

    Google Scholar 

  20. González-Briones, A., Ramos, J., De Paz, J.F.: A drug identification system for intoxicated drivers based on a systematic review. ADCAIJ Adv. Distrib. Comput. Artif. Intell. J. 4(4), 83–101 (2015)

    Google Scholar 

  21. Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 40(2), 121–144 (2010)

    Article  Google Scholar 

  22. Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowl. Inf. Syst. 19(3), 283–309 (2009)

    Article  Google Scholar 

  23. Alcalá-Fdez, J., et al.: KEEL: a software tool to assess evolutionary algorithms for data mining problems. Soft. Comput. 13, 307–318 (2009)

    Article  Google Scholar 

  24. Fernández, A., García, S., Luengo, J., Bernadó-Mansilla, E., Herrera, F.: Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study. IEEE Trans. Evol. Comput. 14(6), 913–941 (2010)

    Article  Google Scholar 

  25. Oyebode, O.K., Adeyemo, J.A.: Genetic programming: principles, applications and opportunities for hydrological modelling. World Acad. Sci. Eng. Technol. Int. J. Environ. Ecol. Geol. Min. Eng. 8, 310–316 (2014)

    Google Scholar 

  26. Freitas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery. In: Ghosh, A., Tsutsui, S. (eds.) Advances in Evolutionary Computation, pp. 819–845. Springer, Heidelberg (2002)

    Google Scholar 

  27. Freitas, A.A.: A review of evolutionary algorithms for data mining. In: Maimon, O., Rokach, L. (eds.) Soft Computing for Knowledge Discovery and Data Mining, Part II, pp. 79–111. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-69935-6_4

    Chapter  Google Scholar 

  28. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  29. Flach, P.: Machine Learning: The Art and Science of Algorithms that Make Sense of Data. Cambridge University Press, Cambridge (2012)

    Book  Google Scholar 

  30. Pappa, G.L., Freitas, A.A.: Automating the Design of Data Mining Algorithms: An Evolutionary Computation Approach. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-02541-9

    Book  MATH  Google Scholar 

  31. Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning, Tools and Techniques, 3rd edn. Elsevier Inc., Waltham (2011)

    Google Scholar 

  32. Bacardit, J., Goldberg, D.E., Butz, M.V.: Improving the performance of a pittsburgh learning classifier system using a default rule. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2003-2005. LNCS (LNAI), vol. 4399, pp. 291–307. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71231-2_20

    Chapter  Google Scholar 

  33. Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley, Reading (1989)

    MATH  Google Scholar 

  34. Blake, C., Merz, C.: Repository of machine learning databases (UCI). Center for Machine Learning and Intelligent Systems (1998)

    Google Scholar 

  35. Kononenko, I., Simec, E., Robnik-Sikonja, M.: Overcoming the myopia of inductive learning algorithms with RELIEFF. Appl. Intell. 7(1), 39–55 (1997)

    Article  Google Scholar 

  36. Li, J., Wong, L.: Using rules to analyse bio-medical data: a comparison between C4.5 and PCL. In: Dong, G., Tang, C., Wang, W. (eds.) WAIM 2003. LNCS, vol. 2762, pp. 254–265. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45160-0_25

    Chapter  Google Scholar 

  37. Zhou, Z.H., Jiang, Y.: NeC4.5: neural ensemble based C4.5. IEEE Trans. Knowl. Data Eng. 16(6), 770–773 (2004)

    Article  Google Scholar 

  38. Smirnov, E., Sprinkhuizen-Kuyper, I.G., Nalbantis, I.: Unanimous voting using support vector machines. Technical report, ERIM and Universiteit Rotterdam, IKAT, Universiteit Maastricht (2004)

    Google Scholar 

Download references

Acknowledgments

This work has been carried out under the iCIS project (CENTRO-07-ST24-FEDER-002003), which has been co-financed by QREN, in the scope of the Mais Centro Program and European Union’s FEDER.

This work has also been partially supported by the Interreg V-A Spain-Portugal Program (PocTep) and the European Regional Development Fund (ERDF) under the IOTEC project (grant 0123_IOTEC_3_E).

The research of Juan Ramos González has been co-financed by the European Social Fund and Junta de Castilla y Len (Operational Programme 2014–2020 for Castilla y Len, BOCYL EDU/602/2016).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to José A. Castellanos-Garzón .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Castellanos-Garzón, J.A., Ramos, J., Martín, Y.M., de Paz, J.F., Costa, E. (2019). A Genetic Programming Approach Applied to Feature Selection from Medical Data. In: Fdez-Riverola, F., Mohamad, M., Rocha, M., De Paz, J., González, P. (eds) Practical Applications of Computational Biology and Bioinformatics, 12th International Conference. PACBB2018 2018. Advances in Intelligent Systems and Computing, vol 803. Springer, Cham. https://doi.org/10.1007/978-3-319-98702-6_24

Download citation

Publish with us

Policies and ethics