A software pipeline for processing and identification of fungal ITS sequences
Open Access
- 15 January 2009
- journal article
- review article
- Published by Springer Nature in Source Code for Biology and Medicine
- Vol. 4 (1) , 1
- https://doi.org/10.1186/1751-0473-4-1
Abstract
Fungi from environmental samples are typically identified to species level through DNA sequencing of the nuclear ribosomal internal transcribed spacer (ITS) region for use in BLAST-based similarity searches in the International Nucleotide Sequence Databases. These searches are time-consuming and regularly require a significant amount of manual intervention and complementary analyses. We here present software – in the form of an identification pipeline for large sets of fungal ITS sequences – developed to automate the BLAST process and several additional analysis steps. The performance of the pipeline was evaluated on a dataset of 350 ITS sequences from fungi growing as epiphytes on building material.This publication has 38 references indexed in Scilit:
- BOV – a web-based BLAST output visualization toolBMC Genomics, 2008
- DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignmentAlgorithms for Molecular Biology, 2008
- Mining metadata from unidentified ITS sequences in GenBank: A case study in Inocybe (Basidiomycota)BMC Ecology and Evolution, 2008
- GenBankNucleic Acids Research, 2007
- Internal transcribed spacer primers and sequences for improved characterization of basidiomycetous orchid mycorrhizasNew Phytologist, 2007
- A basic analysis toolkit for biological sequencesAlgorithms for Molecular Biology, 2007
- HitKeeper, a generic software package for hit list managementSource Code for Biology and Medicine, 2007
- Prospects for fungus identification usingCO1DNA barcodes, withPenicilliumas a test caseProceedings of the National Academy of Sciences, 2007
- MAFFT version 5: improvement in accuracy of multiple sequence alignmentNucleic Acids Research, 2005
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997