AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system
Open Access
- 19 July 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (12) , 3533-3545
- https://doi.org/10.1093/nar/gkl471
Abstract
We have implemented a genome annotation system for prokaryotes called AGMIAL. Our approach embodies a number of key principles. First, expert manual annotators are seen as a critical component of the overall system; user interfaces were cyclically refined to satisfy their needs. Second, the overall process should be orchestrated in terms of a global annotation strategy; this facilitates coordination between a team of annotators and automatic data analysis. Third, the annotation strategy should allow progressive and incremental annotation from a time when only a few draft contigs are available, to when a final finished assembly is produced. The overall architecture employed is modular and extensible, being based on the W3 standard Web services framework. Specialized modules interact with two independent core modules that are used to annotate, respectively, genomic and protein sequences. AGMIAL is currently being used by several INRA laboratories to analyze genomes of bacteria relevant to the food-processing industry, and is distributed under an open source license.Keywords
This publication has 68 references indexed in Scilit:
- The complete genome sequence ofLactobacillus bulgaricusreveals extensive and ongoing reductive evolutionProceedings of the National Academy of Sciences, 2006
- The PANTHER database of protein families, subfamilies, functions and pathwaysNucleic Acids Research, 2004
- The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomesNucleic Acids Research, 2004
- Improved Prediction of Signal Peptides: SignalP 3.0Journal of Molecular Biology, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Systems Biology: A Brief OverviewScience, 2002
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997
- Predicting Coiled Coils from Protein SequencesScience, 1991