TACOA – Taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach
Open Access
- 11 February 2009
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 10 (1) , 56
- https://doi.org/10.1186/1471-2105-10-56
Abstract
Metagenomics, or the sequencing and analysis of collective genomes (metagenomes) of microorganisms isolated from an environment, promises direct access to the "unculturable majority". This emerging field offers the potential to lay solid basis on our understanding of the entire living world. However, the taxonomic classification is an essential task in the analysis of metagenomics data sets that it is still far from being solved. We present a novel strategy to predict the taxonomic origin of environmental genomic fragments. The proposed classifier combines the idea of the k-nearest neighbor with strategies from kernel-based learning.Keywords
This publication has 39 references indexed in Scilit:
- Binning sequences using very sparse labels within a metagenomeBMC Bioinformatics, 2008
- Reliability and applications of statistical methods based on oligonucleotide frequencies in bacterial and archaeal genomesBMC Genomics, 2008
- Phylogenetic classification of short environmental DNA fragmentsNucleic Acids Research, 2008
- Using machine learning algorithms to guide rehabilitation planning for home care clientsBMC Medical Informatics and Decision Making, 2007
- MEGAN analysis of metagenomic dataGenome Research, 2007
- A Regression-based K nearest neighbor algorithm for gene function prediction from heterogeneous dataBMC Bioinformatics, 2006
- Community structure and metabolism through reconstruction of microbial genomes from the environmentNature, 2004
- Ancient horizontal gene transferNature Reviews Genetics, 2003
- Capturing Whole-Genome Characteristics in Short Sequences Using a Naïve Bayesian ClassifierGenome Research, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997