AbundanceBin, Metagenomic Sequencing

Ye, Yuzhen

doi:10.1007/978-1-4614-6418-1_29-4

Yuzhen Ye²

187 Accesses

Definition

Binning is unsupervised clustering of metagenomic sequences into an unknown set of species.

AbundanceBin is a binning tool utilizing the different abundances of the species in a community.

Introduction

Binning is one of the challenging problems in the metagenomics field. It has two main applications. One application is for studying the structure of microbial communities. The other application is for improving the downstream analysis of metagenomic sequences, including metagenome assembly (which has shown to be extremely difficult), considering that assembling reads one bin at a time significantly reduces the complexity of the metagenome assembly problem.

Composition-based methods have been the main approaches to unsupervised classification of reads. The basis of these approaches is that the genome composition (G + C content, dinucleotide frequencies, and synonymous codon usage) vary among organisms and are generally characteristic of evolutionary lineages. Tools in this...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Baran Y, Halperin E. Joint analysis of multiple metagenomic samples. PLoS Comput Biol. 2012;8(2):e1002373.
Article CAS PubMed Central PubMed Google Scholar
Cole JR, Wang Q, Cardenas E, et al. The ribosomal database project: improved alignments and new tools for rRNA analysis. Nucleic Acids Res. 2009;37(Database issue):D141–5.
Article CAS PubMed Central PubMed Google Scholar
Diaz NN, Krause L, Goesmann A, et al. TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach. BMC Bioinformatics. 2009;10:56.
Article PubMed Central PubMed Google Scholar
Gori F, Folino G, Jetten MS, et al. MTR: taxonomic annotation of short metagenomic reads using clustering at multiple taxonomic ranks. Bioinformatics. 2011;27(2):196–203.
Article CAS PubMed Central PubMed Google Scholar
Huson DH, Mitra S. Introduction to the analysis of environmental sequences: metagenomics with MEGAN. Methods Mol Biol. 2012;856:415–29.
Article PubMed Google Scholar
Huson DH, Auch AF, Qi J, et al. MEGAN analysis of metagenomic data. Genome Res. 2007;17(3):377–86.
Article CAS PubMed Central PubMed Google Scholar
Krause L, Diaz NN, Goesmann A, et al. Phylogenetic classification of short environmental DNA fragments. Nucleic Acids Res. 2008;36(7):2230–9.
Article CAS PubMed Central PubMed Google Scholar
Leung HC, Yiu SM, Yang B, et al. A robust and accurate binning algorithm for metagenomic sequences with arbitrary species abundance ratio. Bioinformatics. 2011;27(11):1489–95.
Article CAS PubMed Google Scholar
Liu B, Gibbons T, Ghodsi M, et al. Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences. BMC Genomics. 2011;12 Suppl 2:S4.
Article CAS PubMed Central PubMed Google Scholar
Rosen GL, Reichenberger ER, Rosenfeld AM. NBC: the naive bayes classification tool webserver for taxonomic classification of metagenomic reads. Bioinformatics. 2011;27(1):127–9.
Article CAS PubMed Central PubMed Google Scholar
Stark M, Berger SA, Stamatakis A, et al. MLTreeMap–accurate maximum likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics. 2010;11:461.
Article PubMed Central PubMed Google Scholar
Teeling H, Waldmann J, Lombardot T, et al. TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences. BMC Bioinformatics. 2004;5:163.
Article PubMed Central PubMed Google Scholar
Wang Y, Leung HC, Yiu SM, et al. MetaCluster 4.0: a novel binning algorithm for NGS reads and huge number of species. J Comput Biol. 2012;19(2):241–9.
Article CAS PubMed Google Scholar
Wu YW, Ye Y. A novel abundance-based algorithm for binning metagenomic sequences using l-tuples. J Comput Biol. 2011;18(3):523–34.
Article CAS PubMed Central PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Indiana University, School of Informatics and Computing, 301G Lindley Hall, Bloomington, IN, 47408, USA
Yuzhen Ye

Authors

Yuzhen Ye
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuzhen Ye .

Editor information

Editors and Affiliations

J. Craig Venter Institute (JCVI), Rockville, Maryland, USA
Karen E. Nelson

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Ye, Y. (2013). AbundanceBin, Metagenomic Sequencing. In: Nelson, K. (eds) Encyclopedia of Metagenomics. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6418-1_29-4

Download citation

DOI: https://doi.org/10.1007/978-1-4614-6418-1_29-4
Received: 20 January 2013
Accepted: 20 January 2013
Published: 04 April 2014
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4614-6418-1
eBook Packages: Springer Reference Biomedicine and Life SciencesReference Module Biomedical and Life Sciences

Publish with us

Policies and ethics