Effects of Experimental Choices and Analysis Noise on Surveys of the “Rare Biosphere”
- 15 May 2009
- journal article
- research article
- Published by American Society for Microbiology in Applied and Environmental Microbiology
- Vol. 75 (10) , 3263-3270
- https://doi.org/10.1128/aem.01931-08
Abstract
When planning a survey of 16S rRNA genes from a complex environment, investigators face many choices including which primers to use and how to taxonomically classify sequences. In this study, we explored how these choices affected a survey of microbial diversity in a sample taken from the aerobic basin of the activated sludge of a North Carolina wastewater treatment plant. We performed pyrosequencing reactions on PCR products generated from primers targeting the V1-V2, V6, and V6-V7 variable regions of the 16S rRNA gene. We compared these sequences to 16S rRNA gene sequences found in a whole-genome shotgun pyrosequencing run performed on the same sample. We found that sequences generated from primers targeting the V1-V2 variable region had the best match to the whole-genome shotgun reaction across a range of taxonomic classifications from phylum to family. Pronounced differences between primer sets, however, occurred in the "rare biosphere" involving taxa that we observed in fewer than 11 sequences. We also examined the results of analysis strategies comparing a classification scheme using a nearest-neighbor approach to directly classifying sequences with a naïve Bayesian algorithm. Again, we observed pronounced differences between these analysis schemes in infrequently observed taxa. We conclude that if a study is meant to probe the rare biosphere, both the experimental conditions and analysis choices will have a profound impact on the observed results.Keywords
This publication has 31 references indexed in Scilit:
- Molecular Diversity of a North Carolina Wastewater Treatment Plant as Revealed by PyrosequencingApplied and Environmental Microbiology, 2009
- Real-Time DNA Sequencing from Single Polymerase MoleculesScience, 2009
- Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial TaxonomyApplied and Environmental Microbiology, 2007
- Accuracy and quality of massively parallel DNA pyrosequencingGenome Biology, 2007
- A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteriaJournal of Microbiological Methods, 2007
- Optimization of terminal restriction fragment polymorphism (TRFLP) analysis of human gut microbiotaJournal of Microbiological Methods, 2006
- Microbial diversity in the deep sea and the underexplored “rare biosphere”Proceedings of the National Academy of Sciences, 2006
- Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARBApplied and Environmental Microbiology, 2006
- NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genesNucleic Acids Research, 2006
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004