Skip to main content

A Comprehensive Analysis Workflow for Genome-Wide Screening Data from ChIP-Sequencing Experiments

  • Conference paper
  • 1202 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 5462))

Abstract

ChIP-sequencing is a new technique for generating short DNA sequences useful in analyzing DNA-protein interactions and carrying out genome-wide studies. Although there are some studies to process and analyze ChIP-sequencing data, a complete workflow has not been reported yet. The size of the data and broad range of biological questions are the main challenges to establish a data analysis workflow for ChIP-sequencing data. In this paper, we present the ChIP-sequencing data analysis workflow that we developed at the Ohio State University Comprehensive Cancer Center Bioinformatics Shared Resources. This pipeline utilizes 1) use of different mapping algorithms such as Eland, MapReads, SeqMap, RMAP to align short sequence reads to the reference genome 2) a novel normalization algorithm to detect significant binding densities and to compare binding densities of different experiments 3) gene database mapping and 3D binding density visualization 4) distributed computing and high performance computing (HPC) support.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Johnson, D.S., Mortazavi, A., Myers, R.M., Wold, B.: Genome-wide mapping of in vivo protein-DNA interactions. Science 316(5830), 1497–1502 (2007)

    Article  CAS  PubMed  Google Scholar 

  2. Robertson, G., Hirst, M., Bainbridge, M., Bilenky, M., Zhao, Y., Zeng, T., Euskirchen, G., Bernier, B., Varhol, R., Delaney, A., Thiessen, N., Griffith, O.L., He, A., Marra, M., Snyder, M., Jones, S.: Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing. Nature Methods 4, 651–657 (2007)

    Article  CAS  PubMed  Google Scholar 

  3. Cox, A.: ELAND: Efficient Local Alignment of Nucleotide Data (unpublished)

    Google Scholar 

  4. Zhang, Z., et al.: Fast flexible mapping of AB SOLiD short sequence reads (unpublished)

    Google Scholar 

  5. Jiangi, H., Wong, W.H.: SeqMap: mapping massive amount of oligonucleotides to the Genome. Bioinformatics 24(20), 2395–2396 (2008)

    Article  Google Scholar 

  6. Smith, A.D., Xuan, Z., Zhang, M.Q.: Using quality scores and longer reads improves accuracy of Solexa read mapping. BMC Bioinformatics 9, 128 (2008)

    Article  PubMed  PubMed Central  Google Scholar 

  7. Li, W., Carroll, J.S., Brown, M., Liu, S.: xMAN: extreme MApping of OligoNucleotides. BMC Genomics 9(suppl. 1), S20 (2008)

    Article  Google Scholar 

  8. Bozdag, D., Barbacioru, C., Catalyurek, U.: Parallel Short Sequence Mapping for High Throughput Genome Sequencing. In: 23rd International Parallel and Distributed Processing Symposium (to appear) (2009)

    Google Scholar 

  9. Beynon, M.D., Kurc, T., Catalyurek, U., Chang, C., Sussman, A., Saltz, J.: Distributed processing of very large datasets with DataCutter. Parallel Computing 27(11), 1457–1478 (2001)

    Article  Google Scholar 

  10. Dudoit, S., Yang, Y.H., Callow, M.J., Speed, T.P.: Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Statistica Sinica 12, 111–140 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ozer, H.G. et al. (2009). A Comprehensive Analysis Workflow for Genome-Wide Screening Data from ChIP-Sequencing Experiments. In: Rajasekaran, S. (eds) Bioinformatics and Computational Biology. BICoB 2009. Lecture Notes in Computer Science(), vol 5462. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00727-9_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00727-9_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00726-2

  • Online ISBN: 978-3-642-00727-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics