The COG database: new developments in phylogenetic classification of proteins from complete genomes
Top Cited Papers
Open Access
- 1 January 2001
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 29 (1) , 22-28
- https://doi.org/10.1093/nar/29.1.22
Abstract
The database of Clusters of Orthologous Groups of proteins (COGs), which represents an attempt on a phylogenetic classification of the proteins encoded in complete genomes, currently consists of 2791 COGs including 45 350 proteins from 30 genomes of bacteria, archaea and the yeast Saccharomyces cerevisiae (http://www.ncbi.nlm.nih.gov/COG). In addition, a supplement to the COGs is available, in which proteins encoded in the genomes of two multicellular eukaryotes, the nematode Caenorhabditis elegans and the fruit fly Drosophila melanogaster, and shared with bacteria and/or archaea were included. The new features added to the COG database include information pages with structural and functional details on each COG and literature references, improvements of the COGNITOR program that is used to fit new proteins into the COGs, and classification of genomes and COGs constructed by using principal component analysis.Keywords
This publication has 11 references indexed in Scilit:
- Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs)Genome Biology, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- The COG database: a tool for genome-scale analysis of protein functions and evolutionNucleic Acids Research, 2000
- Using the COG Database to Improve Gene Recognition in Complete GenomesGenetica, 2000
- Complete genomes in WWW Entrez: data representation and analysis.Bioinformatics, 1999
- Complete Genome Sequence of an Aerobic Hyper-thermophilic Crenarchaeon, Aeropyrum pernix K1DNA Research, 1999
- Genome Sequence of the Nematode C. elegans : A Platform for Investigating BiologyScience, 1998
- You are what you eat: a gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomesTrends in Genetics, 1998
- A Genomic Perspective on Protein FamiliesScience, 1997
- Distinguishing Homologous from Analogous ProteinsSystematic Zoology, 1970