Skip to main content

Computational Prediction of cis-Regulatory Modules from Multispecies Alignments Using Galaxy, Table Browser, and GALA

  • Protocol
  • 2886 Accesses

Part of the book series: Methods in Molecular Biology ((MIMB,volume 338))

Abstract

One major goal of genomics is to identify all the functional sequences in genomes, including sequences that regulate the expression of genes. Sequence conservation is a good, albeit imperfect, guide to these functional elements. We describe how to use publicly available servers (Galaxy, the UCSC Table Browser, and GALA) to find genomic sequences whose alignments (from blastZ and multiZ) show properties associated with cis-regulatory modules, such as high conservation score, high regulatory potential score, and conserved transcription factor binding sites. Links to these servers can be accessed at http:// www.bx.psu.edu/ and http://genome.ucsc.edu/.

This is a preview of subscription content, log in via an institution.

Buying options

Protocol
USD   49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Springer Nature is developing a new tool to find and evaluate Protocols. Learn more

References

  1. Collins, F. S., Green, E. D., Guttmacher, A. E., and Guyer, M. S. (2003) A vision for the future of genomics research. Nature 422, 835–847.

    Article  PubMed  CAS  Google Scholar 

  2. Parra, G., Agarwal, P., Abril, J. F., Wiehe, T., Fickett, J. W., and Guigo, R. (2003) Comparative gene prediction in human and mouse. Genome Res. 13, 108–117.

    Article  PubMed  CAS  Google Scholar 

  3. Pennacchio, L. A. and Rubin, E. M. (2001) Genomic strategies to identify mammalian regulatory sequences. Nat. Rev. Genet. 2, 100–109.

    Article  PubMed  CAS  Google Scholar 

  4. Hardison, R. C. (2003) Primer on comparative genomics. Public Library of Science, Biology 1, 156–160.

    CAS  Google Scholar 

  5. Wasserman, W. W. and Sandelin, A. (2004) Applied bioinformatics for the identification of regulatory elements. Nat. Rev. Genet. 5, 276–287.

    Article  PubMed  CAS  Google Scholar 

  6. Gumucio, D., Shelton, D., Zhu, W., et al. (1996) Evolutionary strategies for the elucidation of cis and trans factors that regulate the developmental switching programs of the beta-like globin genes. Mol. Phylog. Evol. 5, 18–32.

    Article  CAS  Google Scholar 

  7. Loots, G. G., Locksley, R. M., Blankespoor, C. M., et al. (2000) Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science 288, 136–140.

    Article  PubMed  CAS  Google Scholar 

  8. Hardison, R. C. (2000) Conserved noncoding sequences are reliable guides to regulatory elements. Trends Genet. 16, 369–372.

    Article  PubMed  CAS  Google Scholar 

  9. Nobrega, M. A., Ovcharenko, I., Afzal, V., and Rubin, E. M. (2003) Scanning human gene deserts for long-range enhancers. Science 302, 413.

    Article  PubMed  CAS  Google Scholar 

  10. Miller, W., Makova, K. D., Nekrutenko, A., and Hardison, R. C. (2004) Comparative genomics. Annu. Rev. Genomics Hum. Genet. 5, 15–56.

    Article  PubMed  CAS  Google Scholar 

  11. Schwartz, S., Kent, W. J., Smit, A., et al. (2003) Human-mouse alignments with Blastz. Genome Res. 13, 103–105.

    Article  PubMed  CAS  Google Scholar 

  12. Blanchette, M., Kent, W. J., Riemer, C., et al. (2004) Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 14, 708–715.

    Article  PubMed  CAS  Google Scholar 

  13. Frazer, K. A., Elnitski, L., Church, D., Dubchak, I., and Hardison, R. C. (2003) Cross-species sequence comparisons: a review of methods and available resources. Genome Res. 13, 1–12.

    Article  PubMed  CAS  Google Scholar 

  14. Siepel, A. Bejerano, G., Pedersen, J. S., et al. (2005) Evolutionarily conserved elements in vertebrate, fly, worm and yeast genomes. Genome Res. 15, 1034–1050.

    Article  PubMed  CAS  Google Scholar 

  15. Kent, W. J., Sugnet, C. W., Furey, T. S., et al. (2002) The human genome browser at UCSC. Genome Res. 12, 996–1006.

    PubMed  CAS  Google Scholar 

  16. Elnitski, L., Hardison, R. C., Li J., et al. (2003) Distinguishing regulatory DNA from neutral sites. Genome Res. 13, 64–72.

    Article  PubMed  CAS  Google Scholar 

  17. Kolbe, D., Taylor, J., Elnitski, L., et al. (2004) Regulatory potential scores from genome-wide three-way alignments of human, mouse and rat. Genome Res. 14, 700–707.

    Article  PubMed  CAS  Google Scholar 

  18. King, D. C., Taylor, J., Elnitski, L., Chiaromonte, F., Miller, W., and Hardison, R. C. (2005) Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences. Genome Res. 15, 1051–1060.

    Article  PubMed  CAS  Google Scholar 

  19. Berman, B. P., Pfeiffer, B. D., Laverty, T. R., et al. (2004) Computational identification of developmental enhancers: conservation and function of transcription factor binding-site clusters in Drosophila melanogaster and Drosophila pseudoobscura. Genome Biol. 5, R61.

    Google Scholar 

  20. Wingender, E., Chen, X., Fricke, E., et al. (2001) The TRANSFAC system on gene expression regulation. Nucleic Acids Res. 29, 281–283.

    Article  PubMed  CAS  Google Scholar 

  21. Sandelin, A., Alkema, W., Engstrom, P., Wasserman, W. W., and Lenhard, B. (2004) JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 32, (Database issue) D91–D94.

    Article  PubMed  CAS  Google Scholar 

  22. Tompa, M., Li, N., Bailey, T. L., et al. (2005) Assessing computational tools for the discovery of transcription factor binding sites. Nat. Biotechnol. 23, 137–144.

    Article  PubMed  CAS  Google Scholar 

  23. Gibbs, R. A., Weinstock, G. M., Metzker, M. L., et al. (2004) Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 428, 493–521.

    Article  PubMed  CAS  Google Scholar 

  24. Schwartz, S., Elnitski, L., Li, M., et al. (2003) MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences. Nucleic Acids Res. 31, 3518–3524.

    Article  PubMed  CAS  Google Scholar 

  25. Giardine, B. M., Elnitski, L., Riemer, C., et al. (2003) GALA, a database for genomic sequence alignments and annotations. Genome Res. 13, 732–741.

    Article  PubMed  CAS  Google Scholar 

  26. Elnitski, L., Giardine, B., Shah, P., et al. (2005) Improvements to GALA and dbERGEII: Databases featuring genomic sequence alignment, annotation and experimental results. Nucleic Acids Res. 32, (Database issue) D466–D447.

    Google Scholar 

  27. Giardine, B., Riemer, C., Hardison, R. C., et al. (2005) Galaxy: a platform for interactive large-scale genome analysis. Genome Res. 15, 1451–1455.

    Article  PubMed  CAS  Google Scholar 

  28. Karolchik, D., Hinrichs, A. S., Furey, T. S., et al. (2004) The UCSC Table Browser data retrieval tool. Nucleic Acids Res. 32, D493–D496.

    Article  PubMed  CAS  Google Scholar 

  29. Weiss, M. J. and Orkin, S. H. (1995) GATA transcription factors: key regulators of hematopoiesis. Exp. Hematol. 23, 99–107.

    PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Humana Press Inc.

About this protocol

Cite this protocol

Elnitski, L., King, D., Hardison, R.C. (2006). Computational Prediction of cis-Regulatory Modules from Multispecies Alignments Using Galaxy, Table Browser, and GALA. In: Bina, M. (eds) Gene Mapping, Discovery, and Expression. Methods in Molecular Biology, vol 338. Humana Press. https://doi.org/10.1385/1-59745-097-9:91

Download citation

  • DOI: https://doi.org/10.1385/1-59745-097-9:91

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-58829-575-0

  • Online ISBN: 978-1-59745-097-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics