Abstract
Accurate gene structure prediction plays a fundamental role in functional annotation of genes. The main focus of gene prediction methods is to find patterns in long DNA sequences that indicate the presence of genes. The problem of gene prediction is an important problem in the field of bioinformatics. With the explosive growth of genomic information there is a need for computational approaches that facilitate gene location, structure and functional prediction. In this paper, we survey various computational approaches for gene predictions
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Wang, Z., Chen, Y., Li, Y.: A Brief Review of Computational Gene Prediction Methods. Geno. Prot. Bioinfo. 2, 216–221 (2004)
Zhang, M.Q.: Computational Prediction of Eukaryotic Protein-Coding Genes. Nature Reviews Genetics 3, 698–709 (2002)
Mathe, C., Sagot, M., Schiex, T., Rouze, P.: Current Methods for Gene Prediction, Their Strengths and Weakness. Nucleic Acid Research 30, 4103–4117 (2002)
Bandyopadhyay, S., Maulik, U., Roy, D.: Gene Identification: Classical and Computational Intelligence Approaches. IEEE Transactions On Systems, Man, And Cybernetics—Part C: Applications And Reviews 38, 55–68 (2008)
Mount, D.W.: Bioinformatics: Genome and Sequence Analysis. Cold Spring Harbor Laboratory Press, New York (2004)
Stormo, G.D.: Gene-Finding Approaches in Eukaryotes. Genome Research 10, 394–397 (2000)
Krogh, A., Mian, I.S., Haussler, D.: A hidden Markov model that finds genes in E. coli DNA. Nucleic Acids Res. 22, 4768–4778 (1994)
Borodovsky, M., McIninch, J.: GENMARK: parallel gene recognition for both DNA strands. Comput. Chem. 17, 123–133 (1993)
Salzberg, S.L., Delcher, A.L., Kasif, S., White, O.: Microbial gene identification using interpolated Markov models. Nucleic Acids Res. 26, 544–548 (1998)
Uberbacher, E.C., Mural, R.J.: Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach. Proc. Natl. Acad. Sci. USA 88, 11261–11265 (1991)
Fickett, J.W.: Recognition of protein coding regions in DNA sequences. Nucleic Acids Res. 10, 5303–5318 (1982)
Hutchinson, G.B., Hayden, M.R.: The prediction of exons through an analysis of spliceable open reading frames. Nucleic Acids Res. 20, 3453–3462 (1992)
Thomas, A., Skolnick, M.H.: A probabilistic model for detecting coding regions in DNA sequences. IMA J. Math. Appl. Med. Biol. 11, 149–160 (1994)
Guigo, R., Knudsen, S., Drake, N., Smith, T.: Prediction of gene structure. J. Mol. Biol. 226, 141–157 (1992)
Snyder, E.E., Stormo, G.D.: Identification of coding regions in genomic DNA sequences. J. Mol. Biol. 248, 1–18 (1995)
Dong, S., Searls, D.B.: Gene structure pre-diction by linguistic methods. Genomics 23, 540–551 (1994)
Solovyev, V.V., Salamov, A.A., Lawrence, C.B.: Predicting internal exons by oligonucleotide composition and discriminate analysis of spliceable open reading frames. Nucleic Acids Res. 22, 5156–5163 (1994)
Burge, C., Karlin, S.: Prediction of Complete Gene Structure in Human Genomic DNA. J. Mol. Biol. 268, 78–94 (1997)
Stanke, M., Waack, S.: Gene Prediction With A Hidden Markov Model and A New Intron Submodel. Bioinformatics 19, 215–225 (2003)
Burge, C., Karlin, S.: Finding the Genes in Genomic DNA. Curr. Opin. Struct. Biol. 8, 346–354 (1998)
Haussler, D.: Computational Genefinding. Trends Biochem. Sci., 12–15 (1998)
Zhang, M.Q.: Identifcation of protein coding regions in the human genome by quadratic discriminate analysis. Proc. Natl. Acad. Sci. USA 94, 565–568 (1997)
Milanesi, L., Kolchanov, N.A., Rogozin, I.B., Ischenko, I.V., Kel, A.E., Orlov, Y.L., Ponomarenko, M.P., Vezzoni, P.: GenView: a computing tool for protein-coding regions prediction in nucleotide sequences. In: Second International Conference on Bioinformatics, Supercomputing and Complex Genome Analysis, pp. 573–588. World Scientific Publishing, Singapore (1993)
Xiong, J.: Essential Bioinformatics. Cambridge University Press, New York (2006)
Fogel, D.B., Chellapilla, K., Fogel, D.B.: Identification of Coding Regions in DNA Sequences Using Evolved Nueral Networks. In: Fogel, G.B., Corne, D.W. (eds.) Evolutionary Computation is Bioinformatics, pp. 195–218. Morgan Kaufmann, USA (2003)
Bajic, V.B., Seah, S.H.: Dragon gene start finder: An advanced system for finding approximate locations of the start of gene transcriptional units. Genome Res. 13, 1923–1929 (2003)
Bajic, V.B., Seah, S.H.: Dragon gene start finder identifies approximate locations of the 5‘ ends of genes. Nucleic Acids Res. 31, 3560–3563 (2003)
Wang, L., Trawick, J.D., Yamamoto, R., Zamudio, C.: Genome-Wide Operon Prediction in Staphylococcus Aureus. Nucleic Acids Res. 32, 3689–3702 (2004)
Staden, R.: Graphic Methods to Determine the Function of Nucleic Acid Sequences. Nucleic Acids Research, 521–538 (1984)
Fields, C.A., Soderlund, C.A.: Gm a practical tool for automating DNA Sequence Analysis. Comput. Appl. Biosci. 6, 263–270 (1990)
Rogozin, I.B., Milanesi, L.: Analysis of Donor Splice Signals in Different Organisms. J. Mol. Evol. 45, 50–59 (1997)
Kleffe, J., Hermann, K., Vahrson, W., Wittig, B., Brendel, V.: Logitlinear Models for the Prediction of Splice Sites in Plant pre-mRNA Sequences. Nucleic Acid Research 24, 4709–4718 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Al-Turaiki, I.M., Mathkour, H., Touir, A., Hammami, S. (2011). Computational Approaches for Gene Prediction: A Comparative Survey. In: Abd Manaf, A., Zeki, A., Zamani, M., Chuprat, S., El-Qawasmeh, E. (eds) Informatics Engineering and Information Science. ICIEIS 2011. Communications in Computer and Information Science, vol 252. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25453-6_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-25453-6_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25452-9
Online ISBN: 978-3-642-25453-6
eBook Packages: Computer ScienceComputer Science (R0)