Abstract
In this chapter we give an introduction to the area of bioinformatics handling the nucleotide sequence analysis problem. We give a brief introduction to the nature of DNA and RNA and use one of many topics - the Translation Initiation Start (TIS) problem - to explain a computational prediction of motifs on biological sequences. Correct identification of the Translation Initiation Start (TIS) in cDNA sequences is an important issue for genome annotation. Here we describe a computational method for TIS identification based in a combination of statistics and Artificial Neural Networks (ANNs). This method makes use of two modules, one sensitive to the conserved motif and the other sensitive to the coding/noncoding potential around the start codon. Finally by applying a method inspired by molecular biology, the simplified method of the ribosome scanning model improves the prediction significantly.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Casey, D. (1992). Primer on molecular genetics. Technical report, Human Genome Managment Information System, Oak Ridge National Laboratory.
Fahlman, S. E. and Lebiere, C. (1990). The cascade-correlation learning architecture. In Touretzky, D., editor, Advances in Neural Information Processing systems II, pages 524–532, Los Altos, California. Morgan Kaufmann.
Hatzigeorgiou, A., Papanikolaou, H., and Reczko, M. (1999). Finding the reading frame in protein coding regions on dna sequences: a combination of statistical and neural network methods. In Mohammadian, M., editor, Computational Intelligence: Neural Networks & Advanced Control Strategies.
Hatzigeorgiou, A. and Reczko, M. (1999). Feature recognition on expressed sequence tags in human dna. In Proc. of the Intern. Joint Conf. on Neural Networks. INNS Press.
Kozak, M. (1984). Compilation and analysis of sequences upstream from the translational start site in eukaryotic mrnas. Nucl. Acids Res., 12:857–872.
Kozak, M. (1996). Interpreting cdna sequences: some insights from studies on translation. Mamalian genome, 7:563–574.
Pain, V. (1990). Initiaton of proteins synthesis in eukaryotic cells. Eur. J. Biochem., 236:747–771.
Riedmiller, M. and Braun, H. (1993). A direct adaptive method for faster back-propagation learning: The RPROP algorithm. In Ruspini, H., editor, Proceedings of the IEEE International Conference on Neural Networks (ICNN 93), pages 586–591. IEEE, San Francisco.
Rumelhart, D. and McClelland, J. (1986). Parallel Distributed Processing: Explorations in the Micro structure of Cognition; Vol. 1: Foundations; Vol. 2: Psychological and Biological Models. MIT Press, Cambridge, Mass.
Waterman, M. (1995). Introduction to Computational Biology: Sequences, Maps and Genomes. Chapman Hall.
Zell, A., Mache, N., Hübner, R., Marnier, G., Vogt, M., Herrmann, K. U., Schmalzl, M., Sommer, T, Hatzigeorgiou, A., Döring, S., Posselt, D., Reczko, M., and Riedmiller, M. (1993). SNNS user manual, version 3.0. Technical report, Universität Stuttgart, Fakultät Informatik.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Hatzigeorgiou, A., Megraw, M. (2006). Computational Analysis of Human DNA Sequences: An Application of Artificial Neural Networks. In: Pintér, J.D. (eds) Global Optimization. Nonconvex Optimization and Its Applications, vol 85. Springer, Boston, MA . https://doi.org/10.1007/0-387-30927-6_7
Download citation
DOI: https://doi.org/10.1007/0-387-30927-6_7
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-30408-3
Online ISBN: 978-0-387-30927-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)