Abstract
One major goal of genomics is to identify all the functional sequences in genomes, including sequences that regulate the expression of genes. Sequence conservation is a good, albeit imperfect, guide to these functional elements. We describe how to use publicly available servers (Galaxy, the UCSC Table Browser, and GALA) to find genomic sequences whose alignments (from blastZ and multiZ) show properties associated with cis-regulatory modules, such as high conservation score, high regulatory potential score, and conserved transcription factor binding sites. Links to these servers can be accessed at http:// www.bx.psu.edu/ and http://genome.ucsc.edu/.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Collins, F. S., Green, E. D., Guttmacher, A. E., and Guyer, M. S. (2003) A vision for the future of genomics research. Nature 422, 835–847.
Parra, G., Agarwal, P., Abril, J. F., Wiehe, T., Fickett, J. W., and Guigo, R. (2003) Comparative gene prediction in human and mouse. Genome Res. 13, 108–117.
Pennacchio, L. A. and Rubin, E. M. (2001) Genomic strategies to identify mammalian regulatory sequences. Nat. Rev. Genet. 2, 100–109.
Hardison, R. C. (2003) Primer on comparative genomics. Public Library of Science, Biology 1, 156–160.
Wasserman, W. W. and Sandelin, A. (2004) Applied bioinformatics for the identification of regulatory elements. Nat. Rev. Genet. 5, 276–287.
Gumucio, D., Shelton, D., Zhu, W., et al. (1996) Evolutionary strategies for the elucidation of cis and trans factors that regulate the developmental switching programs of the beta-like globin genes. Mol. Phylog. Evol. 5, 18–32.
Loots, G. G., Locksley, R. M., Blankespoor, C. M., et al. (2000) Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science 288, 136–140.
Hardison, R. C. (2000) Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet. 16, 369–372.
Nobrega, M. A., Ovcharenko, I., Afzal, V., and Rubin, E. M. (2003) Scanning human gene deserts for long-range enhancers. Science 302, 413.
Miller, W., Makova, K. D., Nekrutenko, A., and Hardison, R. C. (2004) Comparative genomics. Annu. Rev. Genomics Hum. Genet. 5, 15–56.
Schwartz, S., Kent, W. J., Smit, A., et al. (2003) Human-mouse alignments with Blastz. Genome Res. 13, 103–105.
Blanchette, M., Kent, W. J., Riemer, C., et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708–715.
Frazer, K. A., Elnitski, L., Church, D., Dubchak, I., and Hardison, R. C. (2003) Cross-species sequence comparisons: a review of methods and available resources. Genome Res. 13, 1–12.
Siepel, A. Bejerano, G., Pedersen, J. S., et al. (2005) Evolutionarily conserved elements in vertebrate, fly, worm and yeast genomes. Genome Res. 15, 1034–1050.
Kent, W. J., Sugnet, C. W., Furey, T. S., et al. (2002) The human genome browser at UCSC. Genome Res. 12, 996–1006.
Elnitski, L., Hardison, R. C., Li J., et al. (2003) Distinguishing regulatory DNA from neutral sites. Genome Res. 13, 64–72.
Kolbe, D., Taylor, J., Elnitski, L., et al. (2004) Regulatory potential scores from genome-wide three-way alignments of human, mouse and rat. Genome Res. 14, 700–707.
King, D. C., Taylor, J., Elnitski, L., Chiaromonte, F., Miller, W., and Hardison, R. C. (2005) Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences. Genome Res. 15, 1051–1060.
Berman, B. P., Pfeiffer, B. D., Laverty, T. R., et al. (2004) Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura. Genome Biol. 5, R61.
Wingender, E., Chen, X., Fricke, E., et al. (2001) The TRANSFAC system on gene expression regulation. Nucleic Acids Res. 29, 281–283.
Sandelin, A., Alkema, W., Engstrom, P., Wasserman, W. W., and Lenhard, B. (2004) JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 32, (Database issue) D91–D94.
Tompa, M., Li, N., Bailey, T. L., et al. (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol. 23, 137–144.
Gibbs, R. A., Weinstock, G. M., Metzker, M. L., et al. (2004) Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 428, 493–521.
Schwartz, S., Elnitski, L., Li, M., et al. (2003) MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518–3524.
Giardine, B. M., Elnitski, L., Riemer, C., et al. (2003) GALA, a database for genomic sequence alignments and annotations. Genome Res. 13, 732–741.
Elnitski, L., Giardine, B., Shah, P., et al. (2005) Improvements to GALA and dbERGEII: Databases featuring genomic sequence alignment, annotation and experimental results. Nucleic Acids Res. 32, (Database issue) D466–D447.
Giardine, B., Riemer, C., Hardison, R. C., et al. (2005) Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 15, 1451–1455.
Karolchik, D., Hinrichs, A. S., Furey, T. S., et al. (2004) The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 32, D493–D496.
Weiss, M. J. and Orkin, S. H. (1995) GATA transcription factors: key regulators of hematopoiesis. Exp. Hematol. 23, 99–107.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Humana Press Inc.
About this protocol
Cite this protocol
Elnitski, L., King, D., Hardison, R.C. (2006). Computational Prediction of cis-Regulatory Modules from Multispecies Alignments Using Galaxy, Table Browser, and GALA. In: Bina, M. (eds) Gene Mapping, Discovery, and Expression. Methods in Molecular Biology, vol 338. Humana Press. https://doi.org/10.1385/1-59745-097-9:91
Download citation
DOI: https://doi.org/10.1385/1-59745-097-9:91
Publisher Name: Humana Press
Print ISBN: 978-1-58829-575-0
Online ISBN: 978-1-59745-097-3
eBook Packages: Springer Protocols