THE COMPOSITIONAL STRUCTURE OF GENE ONTOLOGY TERMS
Open Access
- 1 December 2003
- proceedings article
- Published by World Scientific Pub Co Pte Ltd in Pacific Symposium on Biocomputing
Abstract
An analysis of the term names in the Gene Ontology reveals the prevalence of substring relations between terms: 65.3% of all GO terms contain another GO term as a proper substring. This substring relation often coincides with a derivational relationship between the terms. For example, the term regulation of cell proliferation (GO:0042127) is derived from the term cell proliferation (GO:0008283) by addition of the phrase regulation of. Further, we note that particular substrings which are not themselves GO terms (e.g. regulation of in the preceding example) recur frequently and in consistent subtrees of the ontology, and that these frequently occurring substrings often indicate interesting semantic relationships between the related terms. We describe the extent of these phenomena—substring relations between terms, and the recurrence of derivational phrases such as regulation of—and propose that these phenomena can be exploited in various ways to make the information in GO more computationally accessible, to construct a conceptually richer representation of the data encoded in the ontology, and to assist in the analysis of natural language texts.Keywords
This publication has 8 references indexed in Scilit:
- Bringing ontology to the Gene OntologyComparative and Functional Genomics, 2003
- Knowledge acquisition, consistency checking and concurrency control for Gene Ontology (GO)Bioinformatics, 2003
- A METHODOLOGY TO MIGRATE THE GENE ONTOLOGY TO A DESCRIPTION LOGIC ENVIRONMENT USING DAML+OILPacific Symposium on Biocomputing, 2002
- The lexical properties of the gene ontology.2002
- Creating the Gene Ontology Resource: Design and ImplementationGenome Research, 2001
- Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.2001
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- Inhibition of focal adhesion kinase (FAK) signaling in focal adhesions decreases cell motility and proliferation.Molecular Biology of the Cell, 1996