UCHIME improves sensitivity and speed of chimera detection
Top Cited Papers
Open Access
- 23 June 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (16) , 2194-2200
- https://doi.org/10.1093/bioinformatics/btr381
Abstract
Motivation: Chimeric DNA sequences often form during polymerase chain reaction amplification, especially when sequencing single regions (e.g. 16S rRNA or fungal Internal Transcribed Spacer) to assess diversity or compare populations. Undetected chimeras may be misinterpreted as novel species, causing inflated estimates of diversity and spurious inferences of differences between populations. Detection and removal of chimeras is therefore of critical importance in such experiments. Results: We describe UCHIME, a new program that detects chimeric sequences with two or more segments. UCHIME either uses a database of chimera-free sequences or detects chimeras de novo by exploiting abundance data. UCHIME has better sensitivity than ChimeraSlayer (previously the most sensitive database method), especially with short, noisy sequences. In testing on artificial bacterial communities with known composition, UCHIME de novo sensitivity is shown to be comparable to Perseus. UCHIME is >100× faster than Perseus and >1000× faster than ChimeraSlayer. Contact:robert@drive5.com Availability: Source, binaries and data: http://drive5.com/uchime. Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 21 references indexed in Scilit:
- Removing Noise From Pyrosequenced AmpliconsBMC Bioinformatics, 2011
- Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR ampliconsGenome Research, 2011
- Search and clustering orders of magnitude faster than BLASTBioinformatics, 2010
- An open source chimera checker for the fungal ITS regionMolecular Ecology Resources, 2010
- Recent developments in the MAFFT multiple sequence alignment programBriefings in Bioinformatics, 2008
- New Screening Software Shows that Most Recent Large 16S rRNA Gene Clone Libraries Contain ChimerasApplied and Environmental Microbiology, 2006
- Bellerophon: a program to detect chimeric sequences in multiple sequence alignmentsBioinformatics, 2004
- Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretationQuarterly Journal of the Royal Meteorological Society, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Taxonomic Note: A Place for DNA-DNA Reassociation and 16S rRNA Sequence Analysis in the Present Species Definition in BacteriologyInternational Journal of Systematic and Evolutionary Microbiology, 1994