Putting data integration into practice: using biomedical terminologies to add structure to existing data sources.
- 1 January 2003
- journal article
- Vol. 2003, 125-9
Abstract
A major purpose of biomedical terminologies is to provide uniform concept representation, allowing for improved methods of analysis of biomedical information. While this goal is being realized in bioinformatics, with the emergence of the Gene Ontology as a standard, there is still no real standard for the representation of clinical concepts. As discoveries in biology and clinical medicine move from parallel to intersecting paths, standardized representation will become more important. A large portion of significant data, however, is mainly represented as free text, upon which conducting computer-based inferencing is nearly impossible. In order to test our hypothesis that existing biomedical terminologies, specifically the UMLS Metathesaurus and SNOMED CT, could be used as templates to implement semantic and logical relationships over free text data that is important both clinically and biologically, we chose to analyze OMIM (Online Mendelian Inheritance in Man). After finding OMIM entries' conceptual equivalents in each respective terminology, we extracted the semantic relationships that were present and evaluated a subset of them for semantic, logical, and biological legitimacy. Our study reveals the possibility of putting the knowledge present in biomedical terminologies to its intended use, with potentially clinically significant consequences.This publication has 8 references indexed in Scilit:
- LINKING BIOMEDICAL LANGUAGE INFORMATION AND KNOWLEDGE RESOURCES: GO AND UMLSPublished by World Scientific Pub Co Pte Ltd ,2002
- Models-of-data and models-of-processes in the post-genomic eraMathematical Biosciences, 2002
- In silico biology through “omics”Nature Biotechnology, 2002
- Large-scale open bioinformatics data resources2002
- Scale and context: issues in ontologies to link health- and bio-informatics.2002
- Normal forms for description logic expressions of clinical concepts in SNOMED RT.2001
- A general method for sifting linguistic knowledge from structured terminologies.2000
- Biomedical database inter-connectivity: an experiment linking MIM, GENBANK, and META-1 via MEDLINE.1991