Developing a test collection for biomedical word sense disambiguation.
- 1 January 2001
- journal article
- p. 746-50
Abstract
Ambiguity, the phenomenon that a word has more than one sense, poses difficulties for many current Natural Language Processing (NLP) systems. Algorithms that assist in the resolution of these ambiguities, i.e. which make unambiguous a word, or more generally, a text string, will boost performance of these systems. To test such techniques in the biomedical language domain, we have developed a Word Sense Disambiguation (WSD) test collection that comprises 5,000 unambiguous instances for 50 ambiguous UMLS Metathesaurus strings.This publication has 7 references indexed in Scilit:
- UMLS Concept Indexing for Production Databases: A Feasibility StudyJournal of the American Medical Informatics Association, 2001
- The NLM Indexing Initiative.2000
- A broad-coverage natural language processing system.2000
- Text-based discovery in biomedicine: the architecture of the DAD-system.2000
- The effect of textual variation on concept based information retrieval.1996
- Ambiguity resolution while mapping free text to the UMLS Metathesaurus.1994
- Migraine and Magnesium: Eleven Neglected ConnectionsPerspectives in Biology and Medicine, 1988