Evaluating the UMLS as a source of lexical knowledge for medical language processing.
- 1 January 2001
- journal article
- research article
- p. 189-93
Abstract
Medical language processing (MLP) systems rely on specialized lexicons in order to recognize, classify, and normalize medical terminology, and the performance of an MLP system is dependent on the coverage and quality of such lexicons. However, the acquisition of lexical knowledge is expensive and time-consuming. The UMLS is a comprehensive resource that can be used to acquire lexical knowledge needed for medical language processing. This paper describes methods that use these resources to automatically create lexical entries and generate two lexicons. The first lexicon was created primarily using the UMLS, whereas the second was created by supplementing the lexicon of an existing MLP system called MedLEE with entries based on the UMLS. We subsequently carried out a study, which is the primary focus of this paper, using MedLEE with each of the two lexicons and also the current MedLEE lexicon to measure performance. Overall accuracy, sensitivity, and specificity using the lexicon primarily based on the UMLS were.86,.60, and.96 respectively. Those measures using the MedLEE lexicon alone were.93,.81, and.93, which was significantly better except for specificity; performance using the supplemental lexicon was exactly the same as performance using solely the MedLEE lexicon.This publication has 7 references indexed in Scilit:
- A broad-coverage natural language processing system.2000
- A Semantic Lexicon for Medical Language ProcessingJournal of the American Medical Informatics Association, 1999
- Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries.1999
- Acquisition of lexical resources from SNOMED for medical language processing.1998
- Automated access to a large medical dictionary: Online assistance for research and application in natural language processingComputers and Biomedical Research, 1990
- Published by Stichting OpenAccess Foundation