Removing Noise From Pyrosequenced Amplicons
Top Cited Papers
Open Access
- 28 January 2011
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 12 (1) , 38
- https://doi.org/10.1186/1471-2105-12-38
Abstract
In many environmental genomics applications a homologous region of DNA from a diverse sample is first amplified by PCR and then sequenced. The next generation sequencing technology, 454 pyrosequencing, has allowed much larger read numbers from PCR amplicons than ever before. This has revolutionised the study of microbial diversity as it is now possible to sequence a substantial fraction of the 16S rRNA genes in a community. However, there is a growing realisation that because of the large read numbers and the lack of consensus sequences it is vital to distinguish noise from true sequence diversity in this data. Otherwise this leads to inflated estimates of the number of types or operational taxonomic units (OTUs) present. Three sources of error are important: sequencing error, PCR single base substitutions and PCR chimeras. We present AmpliconNoise, a development of the PyroNoise algorithm that is capable of separately removing 454 sequencing errors and PCR single base errors. We also introduce a novel chimera removal program, Perseus, that exploits the sequence abundances associated with pyrosequencing data. We use data sets where samples of known diversity have been amplified and sequenced to quantify the effect of each of the sources of error on OTU inflation and to validate these algorithms.This publication has 26 references indexed in Scilit:
- Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributionsNature Methods, 2010
- Ironing out the wrinkles in the rare biosphere through improved OTU clusteringEnvironmental Microbiology, 2010
- Hepatitis C Virus Transmission Bottlenecks Analyzed by Deep SequencingJournal of Virology, 2010
- Organismal, genetic, and transcriptional variation in the deeply sequenced gut microbiomes of identical twinsProceedings of the National Academy of Sciences, 2010
- Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimatesEnvironmental Microbiology, 2009
- Accurate determination of microbial diversity from 454 pyrosequencing dataNature Methods, 2009
- Microbial Population Structures in the Deep Marine BiosphereScience, 2007
- Accuracy and quality of massively parallel DNA pyrosequencingGenome Biology, 2007
- Microbial diversity in the deep sea and the underexplored “rare biosphere”Proceedings of the National Academy of Sciences, 2006
- Genome sequencing in microfabricated high-density picolitre reactorsNature, 2005