Skip to main content

Predicting the Presence of Newt-Amphibian Using Genetic Programming

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 318))

Abstract

In nature, aquatic ecosystems play a very important aspect. River valleys, wetlands, and water reservoirs are territories for various species of vegetation and wildlife. The prediction of these species is very important for natural resource planning. In this work, a publicly available UCI dataset containing extracted features from satellite imagery is used to classify the presence of newt-amphibians. We convert this multi-class classification problem to the binary classification problem. The transformation leads to being unbalanced classification problem. For the unbalanced classification, in the original form, most machine learning techniques give biased classification results, and their results are inclined in favor of the majority class. We use genetic programming with a newly proposed Euclidean distance and weight-based (EDWB) fitness function to resolve this problem. The result outcomes are compared with original work, support vector machine (SVM), and GP with the standard fitness function. The proposed approach achieves better results than the original work, SVM, and compared GP methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Baker J, Beebee T, Buckley J, Gent T, Orchard D (2011) Amphibian habitat management handbook. Amphibian and reptile conservation, Bournemouth

    Google Scholar 

  2. Gardner TA, Barlow J, Peres CA (2007) Paradox, presumption and pitfalls in conservation biology: the importance of habitat change for amphibians and reptiles. Biol Conserv 138(1–2):166–179

    Article  Google Scholar 

  3. Blachnik M, Sołtysiak M, Dąbrowska D (2019) Predicting presence of amphibian species using features obtained from gis and satellite images. ISPRS Int J Geo Inf 8(3):123

    Article  Google Scholar 

  4. Kumar A, Kakkar A, Majumdar R, Singh Baghel A (2015) Spatial data mining: recent trends and techniques. In: 2015 international conference on computer and computational sciences (ICCCS), pages 39–43. IEEE

    Google Scholar 

  5. Mojaddadi H, Pradhan B, Nampak H, Ahmad N, Ghazali AHB (2017) Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and gis. Geomatics Natl Hazards Risk 8(2):1080–1102

    Article  Google Scholar 

  6. Kumar A, Sinha N (2020) Classification of forest cover type using random forests algorithm. In: Advances in data and information sciences. Springer, pages 395–402

    Google Scholar 

  7. Avand M, Moradi H et al (2020) Using machine learning models, remote sensing, and gis to investigate the effects of changing climates and land uses on flood probability. J Hydrol, page 125663

    Google Scholar 

  8. Kumar A, Choudhary T (2021) A machine learning approach for the land type classification. In: Innovations in electrical and electronic engineering. Springer

    Google Scholar 

  9. Dua D, Graff C (2017) UCI machine learning repository. URL https://archive.ics.uci.edu/ml/datasets/Amphibians

  10. Beyan C, Fisher R (2015) Classifying imbalanced data sets using similarity based hierarchical decomposition. Pattern Recogn 48(5):1653–1672

    Article  Google Scholar 

  11. Pouyanfar S, Chen S-C (2017) Automatic video event detection for imbalance data using enhanced ensemble deep learning. Int J Semantic Comput 11(01):85–109

    Article  Google Scholar 

  12. Zhu M, Xia J, Jin X, Yan M, Cai G, Yan J, Ning G (2018) Class weights random forest algorithm for processing class imbalanced medical data. IEEE Access 6:4641–4652

    Article  Google Scholar 

  13. Galar M, Fernández A, Barrenechea E, Herrera F (2013) Eusboost: enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling. Pattern Recogn 46(12):3460–3471

    Article  Google Scholar 

  14. Chawla NV, Bowyer KW, Hall LO, Philip W (2002) Kegelmeyersynthetic minority over-sampling technique. Smote J Artif Intell Res 16:321–357

    Article  Google Scholar 

  15. Pei W, Xue B, Shang L, Zhang M (2020) Genetic programming for development of cost-sensitive classifiers for binary high-dimensional unbalanced classification. Appl Soft Comput, page 106989

    Google Scholar 

  16. Mostafizur Rahman M, Davis DN (2013) Addressing the class imbalance problem in medical datasets. Int J Mach Learn Comput 3(2):224

    Article  Google Scholar 

  17. Tahir MAUH, Asghar S, Manzoor A, Asim Noor M (2019) A classification model for class imbalance dataset using genetic programming. IEEE Access 7:71013–71037

    Article  Google Scholar 

  18. Koza JRGP (1992) On the programming of computers by means of natural selection. Genetic Programming

    Google Scholar 

  19. Kumar A, Sinha N, Bhardwaj A (2020) A novel fitness function in genetic programming for medical data classification. J Biomed Inform, page 103623

    Google Scholar 

  20. Asthana M, Gupta KD, Kumar A (2020) Test suite optimization using lion search algorithm. In: Ambient communications and computer systems. Springer, pages 77–90

    Google Scholar 

  21. Koza JR (2010) Human-competitive results produced by genetic programming. Genetic Program Evolvable Mach 11(3–4):251–284

    Article  Google Scholar 

  22. Kumar A, Nagar R, Singh Baghel A (2014) A genetic algorithm approach to release planning in agile environment. In: 2014 international conference on information systems and computer networks (ISCON). IEEE, pages 118–122

    Google Scholar 

  23. Tran B, Xue B, Zhang M (2016) Genetic programming for feature construction and selection in classification on high-dimensional data. Memetic Comput 8(1):3–15

    Article  Google Scholar 

  24. Cuadros-Rodríguez L, Pérez-Castaño E, Ruiz-Samblás C (2016) Quality performance metrics in multivariate classification methods for qualitative analysis. TrAC Trends Anal Chem 80:612–624

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kumar, A., Sinha, N., Bhardwaj, A. (2022). Predicting the Presence of Newt-Amphibian Using Genetic Programming. In: Tiwari, S., Trivedi, M.C., Kolhe, M.L., Mishra, K., Singh, B.K. (eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems, vol 318. Springer, Singapore. https://doi.org/10.1007/978-981-16-5689-7_19

Download citation

Publish with us

Policies and ethics