Bambus 2: scaffolding metagenomes
Open Access
- 16 September 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (21) , 2964-2971
- https://doi.org/10.1093/bioinformatics/btr520
Abstract
Motivation: Sequencing projects increasingly target samples from non-clonal sources. In particular, metagenomics has enabled scientists to begin to characterize the structure of microbial communities. The software tools developed for assembling and analyzing sequencing data for clonal organisms are, however, unable to adequately process data derived from non-clonal sources. Results: We present a new scaffolder, Bambus 2, to address some of the challenges encountered when analyzing metagenomes. Our approach relies on a combination of a novel method for detecting genomic repeats and algorithms that analyze assembly graphs to identify biologically meaningful genomic variants. We compare our software to current assemblers using simulated and real data. We demonstrate that the repeat detection algorithms have higher sensitivity than current approaches without sacrificing specificity. In metagenomic datasets, the scaffolder avoids false joins between distantly related organisms while obtaining long-range contiguity. Bambus 2 represents a first step toward automated metagenomic assembly. Availability: Bambus 2 is open source and available from http://amos.sf.net. Contact:mpop@umiacs.umd.edu Supplementary Information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 47 references indexed in Scilit:
- Opera: Reconstructing Optimal Genomic Scaffolds with High-Throughput Paired-End SequencesJournal of Computational Biology, 2011
- Meta-IDBA: a de Novo assembler for metagenomic dataBioinformatics, 2011
- Succession of microbial consortia in the developing infant gut microbiomeProceedings of the National Academy of Sciences, 2010
- A core gut microbiome in obese and lean twinsNature, 2008
- Aggressive assembly of pyrosequencing reads with matesBioinformatics, 2008
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- ALLPATHS: De novo assembly of whole-genome shotgun microreadsGenome Research, 2008
- Use of simulated data sets to evaluate the fidelity of metagenomic processing methodsNature Methods, 2007
- Community structure and metabolism through reconstruction of microbial genomes from the environmentNature, 2004
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994