Created by W.Langdon from gp-bibliography.bib Revision:1.8178
To date, the best approaches are based on Support Vector Machines (SVMs) that employ standard {"}spectrum{"} features and achieve promoter region classification accuracies from a low of 84percent to a high of 94percent depending on the particular species involved. In this paper, we propose a general and powerful methodology that uses Genetic Programming (GP) techniques to generate more complex and more gene-specific features to be used with a standard SVM for promoter region identification.
We evaluate our methodology on three data sets from different species and observe consistent classification accuracies in the 94-95percent range. In addition, because the GP-generated features are gene-specific, they can be used by biologists to advance their understanding of the architecture of eukaryotic promoter regions.",
Genetic Programming entries for Uday Kamath Kenneth De Jong Amarda Shehu