Creating the Gene Ontology Resource: Design and Implementation
Top Cited Papers
- 1 August 2001
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 11 (8) , 1425-1433
- https://doi.org/10.1101/gr.180801
Abstract
The exponential growth in the volume of accessible biological information has generated a confusion of voices surrounding the annotation of molecular information about genes and their products. The Gene Ontology (GO) project seeks to provide a set of structured vocabularies for specific biological domains that can be used to describe gene products in any organism. This work includes building three extensive ontologies to describe molecular function, biological process, and cellular component, and providing a community database resource that supports the use of these ontologies. The GO Consortium was initiated by scientists associated with three model organism databases: SGD, the Saccharomyces Genome database; FlyBase, the Drosophila genome database; and MGD/GXD, the Mouse Genome Informatics databases. Additional model organism database groups are joining the project. Each of these model organism information systems is annotating genes and gene products using GO vocabulary terms and incorporating these annotations into their respective model organism databases. Each database contributes its annotation files to a shared GO data resource accessible to the public athttp://www.geneontology.org/. The GO site can be used by the community both to recover the GO vocabularies and to access the annotated gene product data sets from the model organism databases. The GO Consortium supports the development of the GO database resource and provides tools enabling curators and researchers to query and manipulate the vocabularies. We believe that the shared development of this molecular annotation resource will contribute to the unification of biological information.Keywords
This publication has 16 references indexed in Scilit:
- The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plantNucleic Acids Research, 2001
- The Genome Sequence of Drosophila melanogasterScience, 2000
- The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouseNucleic Acids Research, 2000
- Integrating functional genomic information into the Saccharomyces Genome DatabaseNucleic Acids Research, 2000
- The EcoCyc and MetaCyc databasesNucleic Acids Research, 2000
- Toward principles for the representation of hierarchical knowledge in formal ontologiesData & Knowledge Engineering, 1999
- The FlyBase Database of the Drosophila Genome Projects and community literatureNucleic Acids Research, 1999
- Toward principles for the design of ontologies used for knowledge sharing?International Journal of Human-Computer Studies, 1995
- A translation approach to portable ontology specificationsKnowledge Acquisition, 1993
- MEDLINE and MeSHMedical Reference Services Quarterly, 1992