A genetic programming-based approach and machine learning approaches to the classification of multiclass anti-malarial datasets
Created by W.Langdon from
gp-bibliography.bib Revision:1.8051
- @Article{Kumari:2018:IJCBDD,
-
author = "Madhulata Kumari and Neeraj Tiwari and
Naidu Subbarao",
-
title = "A genetic programming-based approach and machine
learning approaches to the classification of multiclass
anti-malarial datasets",
-
journal = "International Journal of Computational Biology and
Drug Design",
-
year = "2018",
-
volume = "11",
-
number = "4",
-
pages = "275--294",
-
keywords = "genetic algorithms, genetic programming",
-
ISSN = "1756-0764",
-
bibsource = "DBLP,
http://dblp.uni-trier.de/db/journals/ijcbdd/ijcbdd11.html#KumariTS18",
-
DOI = "doi:10.1504/IJCBDD.2018.096125",
-
abstract = "Feature selection approaches have been widely applied
to deal with the various sample size problem in the
classification of activity of datasets. The present
work focuses on the understanding system of descriptors
of anti-malarial inhibitors by Genetic programming (GP)
to understand the impact of descriptors on inhibitory
effects. The experimental dataset of inhibitors of
anti-malarial was used to derive the optimised system
by GP. Additionally, we have developed machine learning
models using the random forest, decision tree, support
vector machine (SVM) and Naive Bayes on an antimalarial
dataset obtained from ChEMBL database and evaluated for
their predictive capability. Based on the statistical
evaluation, Random Forest model showed the higher area
under the curve (AUC), better accuracy, sensitivity,
and specificity in the cross-validation tests as
compared to others. The statistical results indicated
that the RF model was the best predictive model with
82.51percent accuracy, 89.7percent ROC. We deployed the
RF classifier model on three datasets; phytochemical
compound dataset, NCI natural product dataset IV and
approved drugs dataset containing 918, 423 and 1554
compounds resulting 153, 81 and 250 compounds
respectively as anti-malarial compounds. Further, to
prioritise drug-like compounds, Lipinski's rule was
applied on active phytochemicals which resulted in 13
hit anti-malarial molecules. Thus, such predictive
models are useful to find out novel hit anti-malarial
compounds and could also be used to discover novel
drugs for other diseases.",
-
notes = "Also known as \cite{journals/ijcbdd/KumariTS18}",
- }
Genetic Programming entries for
Madhulata Kumari
Neeraj Tiwari
Naidu Subbarao
Citations