Identification of OBO nonalignments and its implications for OBO enrichment
Open Access
- 7 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (12) , 1448-1455
- https://doi.org/10.1093/bioinformatics/btn194
Abstract
Motivation: Existing projects that focus on the semiautomatic addition of links between existing terms in the Open Biomedical Ontologies can take advantage of reasoners that can make new inferences between terms that are based on the added formal definitions and that reflect nonalignments between the linked terms. However, these projects require that these definitions be necessary and sufficient, a strong requirement that often does not hold. If such definitions cannot be added, the reasoners cannot point to the nonalignments through the suggestion of new inferences. Results: We describe a methodology by which we have identified over 1900 instances of nonredundant nonalignments between terms from the Gene Ontology (GO) biological process (BP), cellular component (CC) and molecular function (MF) ontologies, Chemical Entities of Biological Interest (ChEBI) and the Cell Type Ontology (CL). Many of the 39.8% of these nonalignments whose object terms are more atomic than the subject terms are not currently examined in other ontology-enrichment projects due to the fact that the necessary and sufficient conditions required for the inferences are not currently examined. Analysis of the ratios of nonalignments to assertions from which the nonalignments were identified suggests that BP–MF, BP–BP, BP–CL and CC–CC terms are relatively well-aligned, while ChEBI–MF, BP–ChEBI and CC–MF terms are relatively not aligned well. We propose four ways to resolve an identified nonalignment and recommend an analogous implementation of our methodology in ontology-enrichment tools to identify types of nonalignments that are currently not detected. Availability: The nonalignments discussed in this article may be viewed at http://compbio.uchsc.edu/Hunter_lab/Bada/nonalignments_2008_03_06.html. Code for the generation of these nonalignments is available upon request. Contact: mike.bada@uchsc.eduKeywords
This publication has 10 references indexed in Scilit:
- Enrichment of OBO ontologiesJournal of Biomedical Informatics, 2007
- Investigating subsumption in SNOMED CT: An exploration into large description logic-based biomedical terminologiesArtificial Intelligence in Medicine, 2007
- Relations in biomedical ontologiesGenome Biology, 2005
- An ontology for cell typesGenome Biology, 2005
- Classifying diseases with respect to anatomy: a study in SNOMED CT.2005
- Obol: integrating language and meaning in bio‐ontologiesComparative and Functional Genomics, 2004
- A METHODOLOGY TO MIGRATE THE GENE ONTOLOGY TO A DESCRIPTION LOGIC ENVIRONMENT USING DAML+OILPacific Symposium on Biocomputing, 2002
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- Auditing the Unified Medical Language System with Semantic MethodsJournal of the American Medical Informatics Association, 1998
- Subsumption principles underlying medical concept systems and their formal reconstruction.1994