Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencers
Top Cited Papers
Open Access
- 6 September 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (18) , e120
- https://doi.org/10.1093/nar/gkn491
Abstract
The recent introduction of massively parallel pyrosequencers allows rapid, inexpensive analysis of microbial community composition using 16S ribosomal RNA (rRNA) sequences. However, a major challenge is to design a workflow so that taxonomic information can be accurately and rapidly assigned to each read, so that the composition of each community can be linked back to likely ecological roles played by members of each species, genus, family or phylum. Here, we use three large 16S rRNA datasets to test whether taxonomic information based on the full-length sequences can be recaptured by short reads that simulate the pyrosequencer outputs. We find that different taxonomic assignment methods vary radically in their ability to recapture the taxonomic information in full-length 16S rRNA sequences: most methods are sensitive to the region of the 16S rRNA gene that is targeted for sequencing, but many combinations of methods and rRNA regions produce consistent and accurate results. To process large datasets of partial 16S rRNA sequences obtained from surveys of various microbial communities, including those from human body habitats, we recommend the use of Greengenes or RDP classifier with fragments of at least 250 bases, starting from one of the primers R357, R534, R798, F343 or F517.Keywords
This publication has 33 references indexed in Scilit:
- Phylogenetic classification of short environmental DNA fragmentsNucleic Acids Research, 2008
- Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplexNature Methods, 2008
- The Human Microbiome ProjectNature, 2007
- Short pyrosequencing reads suffice for accurate microbial community analysisNucleic Acids Research, 2007
- Human gut microbes associated with obesityNature, 2006
- An obesity-associated gut microbiome with increased capacity for energy harvestNature, 2006
- The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public dataNucleic Acids Research, 2006
- NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genesNucleic Acids Research, 2006
- Obesity alters gut microbial ecologyProceedings of the National Academy of Sciences, 2005
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990