Skip to main content

Predicting Transcription Factor Binding Sites and Their Cognate Transcription Factors Using Gene Expression Data

  • Protocol
  • First Online:
Plant Gene Regulatory Networks

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1629))

Abstract

A transcription factor (TF) is a DNA binding protein that targets specific binding-sites (TFBSs) to regulate the transcript levels of its downstream genes. Thus, identifying the TF-TFBS pairs is a pivotal step in understanding the function of TFs and the regulatory network in an organism. Here, we describe two methods for predicting the TFBS of a given TF and for predicting the cognate TF of a given TFBS from a set of strongly co-expressed genes, using time-course transcriptome data of maize developing leaves.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bulow L, Steffens NO, Galuschka C, Schindler M, Hehl R (2006) AthaMap: from in silico data to real transcription factor binding sites. In Silico Biol 6(3):243–252

    PubMed  Google Scholar 

  2. Mathelier A, Zhao X, Zhang AW, Parcy F, Worsley-Hunt R, Arenillas DJ, Buchman S, Chen CY, Chou A, Ienasescu H, Lim J, Shyr C, Tan G, Zhou M, Lenhard B, Sandelin A, Wasserman WW (2014) JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res 42(Database issue):D142–D147. doi:10.1093/nar/gkt997

    Article  CAS  PubMed  Google Scholar 

  3. Matys V, Kel-Margoulis OV, Fricke E, Liebich I, Land S, Barre-Dirrie A, Reuter I, Chekmenev D, Krull M, Hornischer K, Voss N, Stegmaier P, Lewicki-Potapov B, Saxel H, Kel AE, Wingender E (2006) TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res 34(Database issue):D108–D110. doi:10.1093/nar/gkj143

    Article  CAS  PubMed  Google Scholar 

  4. Weirauch MT, Yang A, Albu M, Cote AG, Montenegro-Montero A, Drewe P, Najafabadi HS, Lambert SA, Mann I, Cook K, Zheng H, Goity A, van Bakel H, Lozano JC, Galli M, Lewsey MG, Huang E, Mukherjee T, Chen X, Reece-Hoyes JS, Govindarajan S, Shaulsky G, Walhout AJ, Bouget FY, Ratsch G, Larrondo LF, Ecker JR, Hughes TR (2014) Determination and inference of eukaryotic transcription factor sequence specificity. Cell 158(6):1431–1443. doi:10.1016/j.cell.2014.08.009

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Yu CP, Chen SC, Chang YM, Liu WY, Lin HH, Lin JJ, Chen HJ, Lu YJ, Wu YH, Lu MY, Lu CH, Shih AC, Ku MS, Shiu SH, Wu SH, Li WH (2015) Transcriptome dynamics of developing maize leaves and genomewide prediction of cis elements and their cognate transcription factors. Proc Natl Acad Sci U S A 112(19):E2477–E2486. doi:10.1073/pnas.1500605112

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Liu WY, Chang YM, Chen SC, Lu CH, Wu YH, Lu MY, Chen DR, Shih AC, Sheue CR, Huang HC, Yu CP, Lin HH, Shiu SH, Ku MS, Li WH (2013) Anatomical and transcriptional dynamics of maize embryonic leaves during seed germination. Proc Natl Acad Sci U S A 110(10):3979–3984. doi:10.1073/pnas.1301009110

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, Pimentel H, Salzberg SL, Rinn JL, Pachter L (2012) Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and cufflinks. Nat Protoc 7(3):562–578. doi:10.1038/nprot.2012.016

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Bullard JH, Purdom E, Hansen KD, Dudoit S (2010) Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics 11:94. doi:10.1186/1471-2105-11-94

    Article  PubMed  PubMed Central  Google Scholar 

  9. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS (2009) MEME SUITE: tools for motif discovery and searching. Nucleic acids Res 37(Web Server issue):W202–W208. doi:10.1093/nar/gkp335

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Edgar RC (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5:113. doi:10.1186/1471-2105-5-113

    Article  PubMed  PubMed Central  Google Scholar 

  11. Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32(5):1792–1797. doi:10.1093/nar/gkh340

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Yu CP, Lin JJ, Li WH (2016) Positional distribution of transcription factor binding sites in Arabidopsis thaliana. Sci Rep 6:25164. doi:10.1038/srep25164

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Thimm O, Blasing O, Gibon Y, Nagel A, Meyer S, Kruger P, Selbig J, Muller LA, Rhee SY, Stitt M (2004) MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J 37(6):914–939

    Article  CAS  PubMed  Google Scholar 

  14. Schnable JC, Freeling M, Lyons E (2012) Genome-wide analysis of syntenic gene deletion in the grasses. Genome Biol Evol 4(3):265–277. doi:10.1093/gbe/evs009

    Article  PubMed  PubMed Central  Google Scholar 

  15. Franco-Zorrilla JM, Lopez-Vidriero I, Carrasco JL, Godoy M, Vera P, Solano R (2014) DNA-binding specificities of plant transcription factors and their potential to define target genes. Proc Natl Acad Sci U S A 111(6):2367–2372. doi:10.1073/pnas.1316278111

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Jin J, Zhang H, Kong L, Gao G, Luo J (2014) PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res 42(Database issue):D1182–D1187. doi:10.1093/nar/gkt1016

    Article  CAS  PubMed  Google Scholar 

  17. Wingender E, Schoeps T, Donitz J (2013) TFClass: an expandable hierarchical classification of human transcription factors. Nucleic Acids Res 41(Database issue):D165–D170. doi:10.1093/nar/gks1123

    Article  CAS  PubMed  Google Scholar 

  18. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL (2009) BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi:10.1186/1471-2105-10-421

    Article  PubMed  PubMed Central  Google Scholar 

  19. Huang DW, Sherman BT, Tan Q, Kir J, Liu D, Bryant D, Guo Y, Stephens R, Baseler MW, Lane HC, Lempicki RA (2007) DAVID bioinformatics resources: expanded annotation database and novel algorithms to better extract biology from large gene lists. Nucleic Acids Res 35(Web Server issue):W169–W175. doi:10.1093/nar/gkm415

    Article  PubMed  PubMed Central  Google Scholar 

  20. Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 2:28–36

    CAS  PubMed  Google Scholar 

  21. Grant CE, Bailey TL, Noble WS (2011) FIMO: scanning for occurrences of a given motif. Bioinformatics 27(7):1017–1018. doi:10.1093/bioinformatics/btr064

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Gupta S, Stamatoyannopoulos JA, Bailey TL, Noble WS (2007) Quantifying similarity between motifs. Genome Biol 8(2):Artn R24. doi:10.1186/Gb-2007-8-2-R24

    Article  Google Scholar 

  23. Lin JJ, Yu CP, Chang YM, Chen SCC, Li WH (2014) Maize and millet transcription factors annotated using comparative genomic and transcriptomic data. BMC Genomics 15:Artn 818. doi:10.1186/1471-2164-15-818

    Article  Google Scholar 

  24. Li L, Stoeckert CJ Jr, Roos DS (2003) OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res 13(9):2178–2189. doi:10.1101/gr.1224503

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wen-Hsiung Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media LLC

About this protocol

Cite this protocol

Yu, CP., Li, WH. (2017). Predicting Transcription Factor Binding Sites and Their Cognate Transcription Factors Using Gene Expression Data. In: Kaufmann, K., Mueller-Roeber, B. (eds) Plant Gene Regulatory Networks. Methods in Molecular Biology, vol 1629. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7125-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-7125-1_17

  • Published:

  • Publisher Name: Humana Press, New York, NY

  • Print ISBN: 978-1-4939-7124-4

  • Online ISBN: 978-1-4939-7125-1

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics