VMD: a community annotation database for oomycetes and microbial genomes
Open Access
- 1 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (90001) , D379-D381
- https://doi.org/10.1093/nar/gkj042
Abstract
The VBI Microbial Database (VMD) is a database system designed to host a range of microbial genome sequences. At present, the database contains genome sequence and annotation data of two plant pathogens Phytophthora sojae and Phytophthora ramorum. With the completion of the draft genome sequences of these pathogens in collaboration with the DOE Joint Genome Institute (JGI), we have created this resource to make the sequences publicly available. The genome sequences (95 MB for P.sojae and 65 MB for P.ramorum) were annotated with ∼19 000 and ∼16 000 gene models, respectively. We used two different statistical methods to validate these gene models, Fickett's and a log-likelihood method. Functional annotation of the gene models is based on results from BlastX and InterProScan screens. From the InterProScan results, we could assign putative functions to 17 694 genes in P.sojae and 14 700 genes in P.ramorum. We created an easy-to-use genome browser to view the genome sequence data, which opens to detailed annotation pages for each gene model. A community annotation interface is available for registered community members to add or edit annotations. There are ∼ 1600 gene models for P.sojae and ∼700 models for P.ramorum that have already been manually curated. A toolkit is provided as an additional resource for users to perform a variety of sequence analysis jobs. The database is publicly available at http://phytophthora.vbi.vt.edu/.Keywords
This publication has 16 references indexed in Scilit:
- The ProDom database of protein domain families: more emphasis on 3DNucleic Acids Research, 2004
- Recent improvements to the PROSITE databaseNucleic Acids Research, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- The TIGRFAMs database of protein familiesNucleic Acids Research, 2003
- PRINTS and its automatic supplement, prePRINTSNucleic Acids Research, 2003
- The Generic Genome Browser: A Building Block for a Model Organism System DatabaseGenome Research, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Recent improvements to the SMART domain-based sequence annotation resourceNucleic Acids Research, 2002
- InterProScan – an integration platform for the signature-recognition methods in InterProBioinformatics, 2001
- Ab initio Gene Finding in Drosophila Genomic DNAGenome Research, 2000