Abstract
Genome-wide approaches, and specially high-throughput sequencing technologies, have changed the way biotechnology projects are planned and interpreted. Results are no more restricted to a family of genes but a whole transcriptome or more. Similarly, DNA analyses would not be focused on a small genomic region. Instead, complete chromosomes can be inspected on a single experiment. Therefore, current DNA or RNA sequence-related projects require appropriate ways of storing, visualizing, and integrating both intermediate data files and final results, what made them strongly dependent on specific computational tools. The workflow protocol presented in this chapter describes how to convert raw sequence data obtained with deep-sequencing technologies into standard genomic data file formats, ready to be used with popular genomic visualization programs. Methods to integrate bacterial genomic annotation tables with quantitative differential gene expression results are explained. Finally, a simple yet helpful procedure to sort and filter relevant quantitative results and to qualitatively (graphically) evaluate them is presented. All methods described can be applied using free and widely available software tools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Metzker ML (2010) Sequencing technologies - the next generation. Nat Rev Genet 11(1):31–46. doi:10.1038/nrg2626
Hawkins RD, Hon GC, Ren B (2010) Next-generation genomics: an integrative approach. Nat Rev Genet 11(7):476–486. doi:10.1038/nrg2795
Wang Z, Gerstein M, Snyder M (2009) RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet 10(1):57–63. doi:10.1038/nrg2484
Zhou L, Li X, Liu Q, Zhao F, Wu J (2011) Small RNA transcriptome investigation based on next-generation sequencing technology. J Genet Genomics 38(11):505–513. doi:10.1016/j.jgg.2011.08.006
Bamshad MJ, Ng SB, Bigham AW, Tabor HK, Emond MJ, Nickerson DA, Shendure J (2011) Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet 12(11):745–755. doi:10.1038/nrg3031
Bioconductor. Open source software for bioinformatics. http://www.bioconductor.org. Accessed 26 Oct 2014
Kim J, Oliveros JC, Nikel PI, de Lorenzo V, Silva-Rocha R (2013) Transcriptomic fingerprinting of Pseudomonas putida under alternative physiological regimes. Environ Microbiol Rep 5(6):883–891. doi:10.1111/1758-2229.12090
Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26(1):139–140. doi:10.1093/bioinformatics/btp616
Anders S, Huber W (2010) Differential expression analysis for sequence count data. Genome Biol 11(10):R106. doi:10.1186/gb-2010-11-10-r106
Law CW, Chen Y, Shi W, Smyth GK (2014) Voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol 15(2):R29. doi:10.1186/gb-2014-15-2-r29
Ensemble Bacteria (EMBL-EBI). http://bacteria.ensembl.org/index.html. Accessed 26 Oct 2014
Sequence Read Archive (NCBI-NLM). http://www.ncbi.nlm.nih.gov/sra. Accessed 26 Oct 2014
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9(4):357–359. doi:10.1038/nmeth.1923
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25(16):2078–2079. doi:10.1093/bioinformatics/btp352
Anders S, Pyl PT, Huber W (2014) HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics. doi:10.1093/bioinformatics/btu638
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29(1):24–26. doi:10.1038/nbt.1754
Wikipedia. http://en.wikipedia.org/wiki/FASTA_format. Accessed 26 Oct 2014
The Sequence Ontology Project. http://www.sequenceontology.org/gff3.shtml. Accessed 26 Oct 2014
Wikipedia. http://en.wikipedia.org/wiki/FASTQ_format. Accessed 26 Oct 2014
SAM Format Specification (Samtools). http://samtools.github.io/hts-specs/SAMv1.pdf Accessed 26 Oct 2014
BED Format Specification (UCSC). http://genome.ucsc.edu/FAQ/FAQformat.html#format1. Accessed 26 Oct 2014
GFF (Version 2) Format Specification (Wellcome Trust Sanger Institute). http://www.sanger.ac.uk/resources/software/gff/spec.html. Accessed 26 Oct 2014
van Dijk EL, Jaszczyszyn Y, Thermes C (2014) Library preparation methods for next-generation sequencing: tone down the bias. Exp Cell Res 322(1):12–20. doi:10.1016/j.yexcr.2014.01.008
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this protocol
Cite this protocol
Oliveros, J.C. (2015). Approaches for Displaying Complete Transcriptomes of Environmental Bacteria. In: McGenity, T., Timmis, K., Nogales , B. (eds) Hydrocarbon and Lipid Microbiology Protocols. Springer Protocols Handbooks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/8623_2015_59
Download citation
DOI: https://doi.org/10.1007/8623_2015_59
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-50433-8
Online ISBN: 978-3-662-50435-2
eBook Packages: Springer Protocols