Created by W.Langdon from gp-bibliography.bib Revision:1.8120
Strongly typed grammar based genetic programming (GP) is used to evolve regular expressions (RE) to classify gene exons with potential alternative mRNA expression from those without.
http://bioinformatics.essex.ac.uk/users/wlangdon/rnanet RNAnet gives us correlations between Affymetrix HG-U133 Plus 2 GeneChip probe measurements for the same exon across 2757 Homo Sapiens tissue samples from NCBI's GEO database. We identify many non-atomic Ensembl exons. I.e. exons with substructure.
Biological patterns can be data mined by a Backus-Naur form (BNF) context-free grammar using a strongly typed GP written in gawk and using egrep. The automatically produced DNA motifs suggest that alternative polyadenylation is not responsible.
The training data is available on the (broken, see data_url) http://bioinformatics.essex.ac.uk/users/wlangdon/tr-09-02.tar.gz internet.",
Genetic Programming entries for William B Langdon J Rowsell Andrew P Harrison