Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads
Top Cited Papers
- 1 June 2011
- journal article
- research article
- Published by American Society for Microbiology in Applied and Environmental Microbiology
- Vol. 77 (11) , 3846-3852
- https://doi.org/10.1128/aem.02772-10
Abstract
Microbial communities host unparalleled taxonomic diversity. Adequate characterization of environmental and host-associated samples remains a challenge for microbiologists, despite the advent of 16S rRNA gene sequencing. In order to increase the depth of sampling for diverse bacterial communities, we developed a method for sequencing and assembling millions of paired-end reads from the 16S rRNA gene (spanning the V3 region; ∼200 nucleotides) by using an Illumina genome analyzer. To confirm reproducibility and to identify a suitable computational pipeline for data analysis, sequence libraries were prepared in duplicate for both a defined mixture of DNAs from known cultured bacterial isolates (>1 million postassembly sequences) and an Arctic tundra soil sample (>6 million postassembly sequences). The Illumina 16S rRNA gene libraries represent a substantial increase in number of sequences over all extant next-generation sequencing approaches (e.g., 454 pyrosequencing), while the assembly of paired-end 125-base reads offers a methodological advantage by incorporating an initial quality control step for each 16S rRNA gene sequence. This method incorporates indexed primers to enable the characterization of multiple microbial communities in a single flow cell lane, may be modified readily to target other variable regions or genes, and demonstrates unprecedented and economical access to DNAs from organisms that exist at low relative abundances.Keywords
This publication has 37 references indexed in Scilit:
- BIPES, a cost-effective high-throughput method for assessing microbial diversityThe ISME Journal, 2010
- Comparison of two next-generation sequencing technologies for resolving highly complex microbiota composition using tandem variable 16S rRNA gene regionsNucleic Acids Research, 2010
- Microbial community resemblance methods differ in their ability to detect biologically relevant patternsNature Methods, 2010
- Global patterns of 16S rRNA diversity at a depth of millions of sequences per sampleProceedings of the National Academy of Sciences, 2010
- QIIME allows analysis of high-throughput community sequencing dataNature Methods, 2010
- Metagenomic study of the oral microbiota by Illumina high-throughput sequencingJournal of Microbiological Methods, 2009
- The Ribosomal Database Project: improved alignments and new tools for rRNA analysisNucleic Acids Research, 2008
- Next-generation DNA sequencingNature Biotechnology, 2008
- Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplexNature Methods, 2008
- Microbial diversity in the deep sea and the underexplored “rare biosphere”Proceedings of the National Academy of Sciences, 2006