TaxCollector: Modifying Current 16S rRNA Databases for the Rapid Classification at Six Taxonomic Levels
Open Access
- 20 July 2010
- Vol. 2 (7) , 1015-1025
- https://doi.org/10.3390/d2071015
Abstract
The high level of conservation of 16S ribosomal RNA gene (16S rRNA) in all Prokaryotes makes this gene an ideal tool for the rapid identification and classification of these microorganisms. Databases such as the Ribosomal Database Project II (RDP-II) and the Greengenes Project offer access to sets of ribosomal RNA sequence databases useful in identification of microbes in a culture-independent analysis of microbial communities. However, these databases do not contain all of the taxonomic levels attached to the published names of the bacterial and archaeal sequences. TaxCollector is a set of scripts developed in Python language that attaches taxonomic information to all 16S rRNA sequences in the RDP-II and Greengenes databases. These modified databases are referred to as TaxCollector databases, which when used in conjunction with BLAST allow for rapid classification of sequences from any environmental or clinical source at six different taxonomic levels, from domain to species. The TaxCollector database prepared from the RDP-II database is an important component of a new 16S rRNA pipeline called PANGEA. The usefulness of TaxCollector databases is demonstrated with two very different datasets obtained using samples from a clinical setting and an agricultural soil. The six TaxCollector scripts are freely available on http://taxcollector.sourceforge.net and on http://www.microgator.org.Keywords
This publication has 52 references indexed in Scilit:
- Prime-Boost Strategies in Mucosal Immunization Affect Local IgA Production and the Type of Th ResponseFrontiers in Immunology, 2013
- PANGEA: pipeline for analysis of next generation ampliconsThe ISME Journal, 2010
- Fast rise of broadly cross-reactive antibodies after boosting long-lived human memory B cells primed by an MF59 adjuvanted prepandemic vaccineProceedings of the National Academy of Sciences, 2009
- Culture-independent identification of gut bacteria correlated with the onset of diabetes in a rat modelThe ISME Journal, 2009
- A renaissance for the pioneering 16S rRNA genePublished by Elsevier ,2008
- Accurate taxonomy assignments from 16S rRNA sequences produced by highly parallel pyrosequencersNucleic Acids Research, 2008
- Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplexNature Methods, 2008
- Pyrosequencing enumerates and contrasts soil microbial diversityThe ISME Journal, 2007
- The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public dataNucleic Acids Research, 2006
- A Greedy Algorithm for Aligning DNA SequencesJournal of Computational Biology, 2000