Abstract
Computational approaches have been applied in many different biology application domains. When such tools are based on conventional computation, their approach has shown limitations when dealing with complex biological problems. In the present study, a computational evolutionary environment (GASNP) is proposed as a tool to extract classification rules from biological dataset. The main goal of the proposed approach is to allow the discovery of concise, and accurate, high-level rules (from a biological database named dbSNP - Database Single Nucleotide Polymorphism) which can be used as a classification system. More than focusing only on the classification accuracy, the proposed GASNP model aims at balancing prediction precision, interpretability and comprehensibility. The obtained results show that the proposed GASNP has great potential and is capable of extracting useful high-level knowledge that could not be extracted by traditional classification methods such as Decision Trees, One R and the Single Conjunctive Rule Learner, among others, using the same dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Setbal, J.C., Meidanis, J.: Introduction to Computacional Molecular Biology. PWS Publishing Company, Boston (1997)
Baldi, P., Brunak, S.: Bioinformatics: the Machine Learning approach. MIT Press, Cambridge (2001)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Adison-Wesley, Reading (1989)
Fidelis, M.V., Lopes, H.S., Freitas, A.A.: Discovery Comprehensible Classification Rules with a Genetic Algorithm. In: Proceedings of the Congress on Evolutionary Computation, CEC 2000 (2000)
Amaral, L.R., Sadoyama, G., Espindola, F.S., Oliveira, G.M.B.: Oncogenes Classification Measured by Microarray using Genetic Algorithms. In: IASTED International Conference on Artificial Intelligence and Applications, AIA 2008 (2008)
Ross, D.T., Scherf, U., Eisen, M.B., Perou, C.M., Rees, C., Spellman, P., Iyer, V., Jeffrey, S.S., Van de Rijn, M., Waltham, M., Pergamenschikov, A., Lee, J.C.F., Lashkari, D., Shalon, D., Myers, T.G., Weinstein, J.N., Botstein, D., Brown, P.O.: Systematic variation in gene expression patterns in human cancer cell lines. Nature Genetics (2000)
Kitts, A., Sherry, S.: The NCBI Handbook. The National Library of Medicine (2002)
Lopes, H.S., Coutinho, M.S., Lima, W.C.: An evolutionary approach to simulate cognitive feedback learning in medical domain. In: Genetic Algorithms and Fuzzy Logic Systems, World Scientific, Singapore (1997)
Fidelis, M.V., Lopes, H.S., Freitas, A.A.: Discovery comprehensible classification rules with a genetic algorithm. In: Congress on Evolutionary Computation, CEC 2000 (2000)
Holmes, G., Donkin, A., Witten, I.H.: Weka: A machine learning workbench. In: Proceedings of the Second Australia and New Zealand Conference on Intelligent Information Systems (1994)
Collins, F.S., Brooks, L.D., Chakravarti, A.: A DNA Polymorphism Discovery Resource for Research on Human Genetic Variation. Genome Research (1998)
Wang, D.G., Fan, J.B., Siao, C.J., Berno, A., Young, P., Sapolsky, R., Ghandour, G., Perkins, N., Winchester, E., Spencer, J., Kruglyak, L., Stein, L., Hsie, L., Topaloglou, T., Hubbell, E., Robinson, E., Mittmann, M., Morris, M.S., Shen, N., Kilburn, D., Rioux, J., Nusbaum, C., Rozen, S., Hudson, T.J., Lander, E.S., Lipshutz, R., Chee, M.: Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome, Science (1998)
Masood, E.: As consortium plans free SNP map of human genome. Nature (1999)
Wang, Z., Moult, J.: SNPs, Protein Structure, and Disease. Human Mutation (2001)
Mooney, S.: Bioinformatics approaches and resources for single nucleotide polymorphism functional analysis. Briefings in Bioinformatics (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bevilaqua, A., Rodrigues, F.A., do Amaral, L.R. (2011). SNPs Classification: Building Biological High-Level Knowledge Using Genetic Algorithms. In: Hruschka, E.R., Watada, J., do Carmo Nicoletti, M. (eds) Integrated Computing Technology. INTECH 2011. Communications in Computer and Information Science, vol 165. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22247-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-22247-4_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22246-7
Online ISBN: 978-3-642-22247-4
eBook Packages: Computer ScienceComputer Science (R0)