Abstract
Chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis can detect protein/DNA-binding and histone-modification sites across an entire genome. As there are various factors during sample preparation that affect the obtained results, multilateral quality assessments are essential. Here, we describe a step-by-step protocol using DROMPA, a program for user-friendly ChIP-seq pipelining. DROMPA can be used for quality assessment, data normalization, visualization, peak calling, and multiple statistical analyses.
References
Nakato R, Shirahige K (2017) Recent advances in ChIP-seq analysis: from quality management to whole-genome annotation. Brief Bioinform 18:279. doi:10.1093/bib/bbw023
Gerstein MB, Rozowsky J, Yan KK, Wang D, Cheng C, Brown JB, Davis CA, Hillier L, Sisu C, Li JJ, Pei B, Harmanci AO, Duff MO, Djebali S, Alexander RP, Alver BH, Auerbach R, Bell K, Bickel PJ, Boeck ME, Boley NP, Booth BW, Cherbas L, Cherbas P, Di C, Dobin A, Drenkow J, Ewing B, Fang G, Fastuca M, Feingold EA, Frankish A, Gao G, Good PJ, Guigo R, Hammonds A, Harrow J, Hoskins RA, Howald C, Hu L, Huang H, Hubbard TJ, Huynh C, Jha S, Kasper D, Kato M, Kaufman TC, Kitchen RR, Ladewig E, Lagarde J, Lai E, Leng J, Lu Z, MacCoss M, May G, McWhirter R, Merrihew G, Miller DM, Mortazavi A, Murad R, Oliver B, Olson S, Park PJ, Pazin MJ, Perrimon N, Pervouchine D, Reinke V, Reymond A, Robinson G, Samsonova A, Saunders GI, Schlesinger F, Sethi A, Slack FJ, Spencer WC, Stoiber MH, Strasbourger P, Tanzer A, Thompson OA, Wan KH, Wang G, Wang H, Watkins KL, Wen J, Wen K, Xue C, Yang L, Yip K, Zaleski C, Zhang Y, Zheng H, Brenner SE, Graveley BR, Celniker SE, Gingeras TR, Waterston R (2014) Comparative analysis of the transcriptome across distant species. Nature 512(7515):445–448. doi:10.1038/nature13424
Sutani T, Sakata T, Nakato R, Masuda K, Ishibashi M, Yamashita D, Suzuki Y, Hirano T, Bando M, Shirahige K (2015) Condensin targets and reduces unwound DNA structures associated with transcription in mitotic chromosome condensation. Nat Commun 6:7815. doi:10.1038/ncomms8815
Consortium RE, Kundaje A, Meuleman W, Ernst J, Bilenky M, Yen A, Heravi-Moussavi A, Kheradpour P, Zhang Z, Wang J, Ziller MJ, Amin V, Whitaker JW, Schultz MD, Ward LD, Sarkar A, Quon G, Sandstrom RS, Eaton ML, Wu YC, Pfenning AR, Wang X, Claussnitzer M, Liu Y, Coarfa C, Harris RA, Shoresh N, Epstein CB, Gjoneska E, Leung D, Xie W, Hawkins RD, Lister R, Hong C, Gascard P, Mungall AJ, Moore R, Chuah E, Tam A, Canfield TK, Hansen RS, Kaul R, Sabo PJ, Bansal MS, Carles A, Dixon JR, Farh KH, Feizi S, Karlic R, Kim AR, Kulkarni A, Li D, Lowdon R, Elliott G, Mercer TR, Neph SJ, Onuchic V, Polak P, Rajagopal N, Ray P, Sallari RC, Siebenthall KT, Sinnott-Armstrong NA, Stevens M, Thurman RE, Wu J, Zhang B, Zhou X, Beaudet AE, Boyer LA, De Jager PL, Farnham PJ, Fisher SJ, Haussler D, Jones SJ, Li W, Marra MA, McManus MT, Sunyaev S, Thomson JA, Tlsty TD, Tsai LH, Wang W, Waterland RA, Zhang MQ, Chadwick LH, Bernstein BE, Costello JF, Ecker JR, Hirst M, Meissner A, Milosavljevic A, Ren B, Stamatoyannopoulos JA, Wang T, Kellis M (2015) Integrative analysis of 111 reference human epigenomes. Nature 518(7539):317–330. doi:10.1038/nature14248
Nakato R, Itoh T, Shirahige K (2013) DROMPA: easy-to-handle peak calling and visualization software for the computational analysis and validation of ChIP-seq data. Genes Cells 18(7):589–601. doi:10.1111/gtc.12058
Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10(3):R25. doi:10.1186/gb-2009-10-3-r25
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359. doi:10.1038/nmeth.1923
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14):1754–1760. doi:10.1093/bioinformatics/btp324
Ernst J, Kellis M (2012) ChromHMM: automating chromatin-state discovery and characterization. Nat Methods 9(3):215–216. doi:10.1038/nmeth.1906
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
-
1.
Script for the visualization of human (data obtained from the Sequence Read Archive (SRA) under accession SRP006944)
# convert map files to bin data $ for sample in Control H3K27me3 H3K36me3 H3K4me3 H3K9me3 Pol2_b; do $ for bin in 100 1000 100000;do parse2wig -f BAM -i HeLa-S3_$sample-n2-m1-hg19.sort.bam -o $sample -gt genome_table -binsize $bin; done $ done # make pdf files $ IP1="parse2wigdir/Pol2_b" $ IP2="parse2wigdir/H3K4me3" $ IP3="parse2wigdir/H3K27me3" $ IP4="parse2wigdir/H3K36me3" $ IP5="parse2wigdir/H3K9me3" $ Input="parse2wigdir/Control" # Figure 2a $ s1="-i $IP1,$Input,Pol2_b,,,100" $ s2="-i $IP2,$Input,H3K4me3,,,80" $ s3="-i $IP3,$Input,H3K27me3,,1000,60" $ s4="-i $IP4,$Input,H3K36me3,,1000,60" $ drompa_draw PC_SHARP -gt genome_table -gene refFlat.txt $s1 $s2 $s3 $s4 -p Fig 2a -lpp 1 -chr 1 -ls 1000 -rmchr -show_itag 2 # Figure 2b $ s1="-i $IP2,$Input,H3K4me3" $ s2="-i $IP3,$Input,H3K27me3" $ s3="-i $IP4,$Input,H3K36me3" $ s4="-i $IP5,$Input,H3K9me3" $ drompa_draw GV -gt genome_table $s1 $s2 $s3 $s4 -p Fig 2b_liner -GC GCcontents -gcsize 500000 -GD genedensity -gdsize 500000 $ drompa_draw GV -gt genome_table $s1 $s2 $s3 $s4 -p Fig 2b_log -GC GCcontents -gcsize 500000 -GD genedensity -gdsize 500000 -showratio 2
-
2.
Script for the visualization of S. cerevisiae (data obtained from SRA under accession SRP009385)
$ gt=genome_table_sacCer3 $ index=UCSC-sacCer3-cs # bowtie index for colorspace data # read mapping and parse2wig $ for num in $(seq 398609 398624); do $ prefix=SRR$num $ bowtie -C $index $prefix.fastq -p8 -S > $prefix.sam $ parse2wig -f SAM -i $prefix.sam -o $prefix -gt $gt $done # generate pdf files $ dir=parse2wigdir $ IP1_60="$dir/SRR398612$postfix" # YST1019 Gal 60min $ IP1_0="$dir/SRR398611$postfix" # YST1019 Gal 0min $ IP2_60="$dir/SRR398610$postfix" # YST1019 Raf 60min $ IP2_0="$dir/SRR398609$postfix" # YST1019 Raf 0min $ IP3_60="$dir/SRR398616$postfix" # YST1053 Gal 60min $ IP3_0="$dir/SRR398615$postfix" # YST1053 Gal 0min $ IP4_60="$dir/SRR398614$postfix" # YST1053 Raf 60min $ IP4_0="$dir/SRR398613$postfix" # YST1053 Raf 0min $ IP5_60="$dir/SRR398620$postfix" # YST1076 Gal 60min $ IP5_0="$dir/SRR398618$postfix" # YST1076 Gal 0min $ IP6_60="$dir/SRR398619$postfix" # YST1076 Raf 60min $ IP6_0="$dir/SRR398617$postfix" # YST1076 Raf 0min $ IP7_60="$dir/SRR398624$postfix" # YST1287 Gal 60min $ IP7_0="$dir/SRR398623$postfix" # YST1287 Gal 0min $ IP8_60="$dir/SRR398622$postfix" # YST1287 Raf 60min $ IP8_0="$dir/SRR398621$postfix" # YST1287 Raf 0min $ s1="-i $IP1_60,$IP1_0,YST1019_Gal" $ s2="-i $IP2_60,$IP2_0,YST1019_Raf" $ s3="-i $IP3_60,$IP3_0,YST1053_Gal" $ s4="-i $IP4_60,$IP4_0,YST1053_Raf" $ s5="-i $IP5_60,$IP5_0,YST1076_Gal" $ s6="-i $IP6_60,$IP6_0,YST1076_Raf" $ s7="-i $IP7_60,$IP7_0,YST1287_Gal" $ s8="-i $IP8_60,$IP8_0,YST1287_Raf" $ drompa_draw PC_ENRICH -p SRP009385 -gt $gt -ars ARS-oriDB_scer.txt -gene SGD_features.tab -gftype 3 $s1 $s2 $s3 $s4 $s5 $s6 $s7 $s8 -ls 250 -lpp 2 -scale_ratio 1 -bn 3 -ystep 14 -ethre 1.5
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media LLC
About this protocol
Cite this protocol
Nakato, R., Shirahige, K. (2018). Statistical Analysis and Quality Assessment of ChIP-seq Data with DROMPA. In: Muzi-Falconi, M., Brown, G. (eds) Genome Instability. Methods in Molecular Biology, vol 1672. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7306-4_41
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7306-4_41
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7305-7
Online ISBN: 978-1-4939-7306-4
eBook Packages: Springer Protocols