The Subsystems Approach to Genome Annotation and its Use in the Project to Annotate 1000 Genomes
Top Cited Papers
Open Access
- 25 September 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 33 (17) , 5691-5702
- https://doi.org/10.1093/nar/gki866
Abstract
The release of the 1000th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.Keywords
This publication has 27 references indexed in Scilit:
- Automatic detection of subsystem/pathway variants in genome analysisBioinformatics, 2005
- Ancient Origin of the Tryptophan Operon and the Dynamics of Evolutionary ChangeMicrobiology and Molecular Biology Reviews, 2003
- Inhibitors of Pantothenate Kinase: Novel Antibiotics for Staphylococcal InfectionsAntimicrobial Agents and Chemotherapy, 2003
- Phylogenomic Analysis of the Giardia intestinalis Transcarboxylase Reveals Multiple Instances of Domain Fusion and Fission in the Evolution of Biotin-Dependent EnzymesMicrobial Physiology, 2003
- Mutations in theAUH gene cause 3-methylglutaconic aciduria type IHuman Mutation, 2003
- Missing genes in metabolic pathways: a comparative genomics approachCurrent Opinion in Chemical Biology, 2003
- Function prediction and protein networksCurrent Opinion in Cell Biology, 2003
- Comparative Genomics of Thiamin Biosynthesis in ProcaryotesJournal of Biological Chemistry, 2002
- Conservation of the Biotin Regulon and the BirA Regulatory Signal in Eubacteria and ArchaeaGenome Research, 2002
- Complete Reconstitution of the Human Coenzyme A Biosynthetic Pathway via Comparative GenomicsJournal of Biological Chemistry, 2002