Abstract
One of the main challenges in modern biology and genome research is to understand the complex mechanisms that regulate gene expression. Being able to tell when, why, and how one or more genes are activated could provide information of inestimable value for the understanding of the mechanisms of life. The wealth of genomic data now available opens new opportunities to researchers. We present how a method based on genetic algorithms has been applied to the characterization of two regulatory signals in DNA sequences, that help the cellular apparatus to locate the beginning of a gene along the genome, and to start its transcription. The signals have been derived from the analysis of a large number of genomic sequences. Comparisons with related work show that our method presents different improvements, both from the computational viewpoint, and in the biological relevance of the results obtained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bellorini, M., Dantonel, J.C., Yoon, J.B., Roeder, R.G., Tora, L., Mantovani, R.: The major histocompatibility complex class II EA promoter requires TFIID binding to an initiator sequence. Mol. Cell. Biol. 16, 503–512 (1996)
Stormo, G.D.: DNA binding sites: representation and discovery. Bioinformatics 16, 16–23 (2000)
Pavesi, G., Mauri, G., Pesole, G.: Methods for pattern discovery in unaligned biological sequences. Briefings in Bioinformatics 2, 417–430 (2001)
Bucher, P.: Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences. J. Mol. Biol. 212, 563–578 (1990)
Hertz, G.Z., Stormo, G.D.: Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics 15, 563–577 (1999)
Bailey, T., Elkan, C.: Unsupervised learning of multiple motifs in biopolymers using expectation maximization. Machine Learning 21, 51–80 (1995)
Lawrence, C., Altschul, S., Boguski, M., Liu, J., Neuwald, A., Wooton, J.: Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science 262, 208–214 (1993)
Wall, M.: http://lancet.mit.edu/ga/
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Massachusetts (1989)
Praz, V., Périer, R., Bonnard, C., Bucher, P.: The eukaryotic promoter database, EPD: new entry types and links to gene expression data. Nucleic Acids Res. 30, 322–324 (2002)
Kutach, A., Kadonaga, J.: The downstream promoter element DPE appears to be as widely used as the TATA box in Drosophila core promoters. Mol. Cell Biol. 20, 4754–4764 (2000)
Kim, J.L., Nikolov, D.B., Burley, S.I.: Co-crystal structure of TBP recognizing the minor groove of a TATA element. Nature 365, 520–527 (1993)
Lo, K., Smale, S.T.: Generality of a functional initiator consensus sequence. Gene 182, 13–22 (1996)
Okazaki, Y., Furuno, M., Kasukawa, T., et al.: Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420, 563–573 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mauri, G., Mosca, R., Pavesi, G. (2004). A GA Approach to the Definition of Regulatory Signals in Genomic Sequences. In: Deb, K. (eds) Genetic and Evolutionary Computation – GECCO 2004. GECCO 2004. Lecture Notes in Computer Science, vol 3102. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24854-5_39
Download citation
DOI: https://doi.org/10.1007/978-3-540-24854-5_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22344-3
Online ISBN: 978-3-540-24854-5
eBook Packages: Springer Book Archive