Abstract
The paradigm of identifying genetic risk factors for common human diseases by analyzing one DNA sequence variation at a time is quickly being replaced by research strategies that embrace the multivariate complexity of the genotype to phenotype mapping relationship that is likely due, in part, to nonlinear interactions among many genetic and environmental factors. Embracing the complexity of common diseases such as cancer requires powerful computational methods that are able to model nonlinear interactions in high-dimensional genetic data. Previously, we have addressed this challenge with the development of a computational evolution system (CES) that incorporates greater biological realism than traditional artificial evolution methods, such as genetic programming. Our results have demonstrated that CES is capable of efficiently navigating these large and rugged fitness landscapes toward the discovery of biologically meaningful genetic models of disease predisposition. Further, we have shown that the efficacy of CES is improved dramatically when the system is provided with statistical expert knowledge, derived from a family of machine learning techniques known as Relief, or biological expert knowledge, derived from sources such as protein-protein interaction databases. The goal of the present study was to apply CES to the genetic analysis of prostate cancer aggressiveness in a large sample of European Americans. We introduce here the use of 3D visualization methods to identify interesting patterns in CES results. Information extracted from the visualization through human-computer interaction are then provide as expert knowledge to newCES runs in a cascading framework. We present aCES-derived multivariate classifier and provide a statistical and biological interpretation in the context of prostate cancer prediction. The incorporation of human-computer interaction into CES provides a first step towards an interactive discovery system where the experts can be embedded in the computational discovery process. Our working hypothesis is that this type of human-computer interaction will provide more useful results for complex problem solving than the traditional black box machine learning approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Banzhaf, W., Beslon, G., Christensen, S., Foster, J.A., K´ep`es, F., Lefort, V., Miller, J.F., Radman,M., and Ramsden, J.J. (2006). From artificial evolution to computational evolution: a research agenda. Nature Reviews Genetics, 7:729–735.
Banzhaf, W., Nordin, P., Keller, R.E., and Francone, F.D. (1998). Genetic Programming Ð An Introduction; On the Automatic Evolutionof Computer Programs and its Applications. Morgan Kaufmann, San Francisco, CA, USA.
Cordell, H.J. (2009). Detecting gene-gene interactions that underlie human diseases. Nature Reviews Genetics, 10:392–404.
Fogel, G.B. and Corne, D.W. (2003). Evolutionary Computation in Bioinformatics. Morgan Kaufmann Publishers.
Greene, C.S., Hill, D.P., and Moore, J.H. (2009a). Environmental noise improves epistasis models of genetic data discovered using a computational evolution system. In Proceedings of the Genetic and Evolutionary Computation Conference, pages 1785–1786.
Greene, C.S., Hill, D.P., andWhite, B.C. (2010). Genetic Programming Theory and Practice VII, chapter Environmental sensing using expert knowledge in a computational evolution system for complex problem solving in human genetics, pages 195–210. Springer, Ann Arbor.
Greene, C.S., White, B.C., and Moore, J.H. (2009b). An expert knowledgeguided mutation operator for genome-wide genetic analysis using genetic programming. In Lecture Notes in Bioinformatics, volume 4774, pages 30– 40.
Greene, C.S.,White, B.C., and Moore, J.H. (2009c). Sensible initialization using expert knowledge for genome-wide analysis of epistasis using genetic programming. In Proceedings of the IEEE Congress on Evolutionary Computation, pages 1289–1296.
Hastie, T., Tibshirani, R., and Friedman, J. (2001). The Elements of Statistical Learning : Data Mining, Inference, and Prediction. New York: Springer- Verlag.
Heer, J., Bostock, M., and Ogievetsky, V. (2010). A tour through the visualization zoo. Comm ACM, 53:59.
Koza, J.R. (1992). Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA, USA.
Langley, P. (2002). Lessons for the computational discovery of scientific knowledge. In Proceedings of First International Workshop on Data Mining Lessons Learned, volume 1, pages 9–12.
McKinney, B.A., Reif, D.M, Ritchie, M.D., and Moore, J.H. (2006). Machine learning for detecting gene-gene interactions: a review. Appl. Bioinformatics, 5:77–88.
Mitchell, T.M. (1997). Machine Learning. MacGraw-Hill, Boston.
Moore, J.H., Andrews, P.C., Barney, N., andWhite, B.C. (2008). Development and evaluation of an open-ended computational evolution system for the genetic analysis of susceptibility to common human diseases. In Lecture Notes in Computer Science, volume 4973, pages 129–140.
Moore, J.H.,Asselbergs, F.W., andWilliams, S.M. (2010). Bioinformatics challenges for genome-wide association studies. Bioinformatics, 26(4):445–455.
Moore, J.H.,Gilbert, J.C., Tsai, C.-T., Chiang, F.T.,Holden,W., Barney, N., and White, B.C. (2006). A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. Journal of Theoretical Biology, 241:252–61.
Moore, J.H., Lari, R.C.,Hill, D.,Hibberd, P.L., andMadan, J.C. (2011). Human microbiome visualization using 3d technology. In Pac Symp Biocomput., pages 154–64.
Moore, J.H., Parker, J.S., Olsen, N.J., and Aune, T.M. (2002). Symbolic discriminant analysis of microarray data in autoimmune disease. Genetic Epidemiology, 23:57–69.
Moore, J.H. and White, B.C. (2007). Tuning relieff for genome-wide genetic analysis. Lec. Notes Comp. Sci., 4447:166–175.
Moore, J.H. and Williams, S.M. (2009). Epistasis and its implications for personal
genetics. American Journal of Human Genetics, 85:309–320.
Motsinger, A.A., Ritchie, M.D., and Reif, D.M (2007). Novel methods for detecting epistasis in pharmacogenomics studies. Pharmacogenomics, 8:1229– 41.
Pattin, K.A., Payne, J.L., Hill, D.P., Caldwell, T., Fisher, J., and Moore, J.H. (2011). Genetic Programming Theory and Practice VIII, chapter Exploiting expert knowledge of protein-protein interactions in a computational evolution system for detecting epistasis in common human disease, pages 195–210. Springer.
Payne, J.L., Greene, C.S., Hill, D.P., and Moore, J.H. (2010). Exploitation of Linkage Learning in Evolutionary Algorithms, chapter 10: Sensible initialization of a computational evolution systemusing expert knowledge for epistasis analysis in human genetics, pages 215–226. Springer.
Spector, L. (2003). Genetic Programming Theory and Practice, chapter An essay concerning human understanding of genetic programming, pages 11– 24. Springer.
Thomas, J. and Cook, K. (2005). Illuminating the Path: Research and Development Agenda for Visual Analytics. IEEE Press.
Thornton-Wells, T.A., Moore, J.H., and Haines, J.L. (2004). Genetics, statistics and human disease: analytical retooling for complexity. Trends Genet., 20:640–7.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Moore, J.H., Hill, D.P., Fisher, J.M., Lavender, N., Kidd, L.C. (2011). Human-Computer Interaction in a Computational Evolution System for the Genetic Analysis of Cancer. In: Riolo, R., Vladislavleva, E., Moore, J. (eds) Genetic Programming Theory and Practice IX. Genetic and Evolutionary Computation. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-1770-5_9
Download citation
DOI: https://doi.org/10.1007/978-1-4614-1770-5_9
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-1769-9
Online ISBN: 978-1-4614-1770-5
eBook Packages: Computer ScienceComputer Science (R0)