Word segmentation processing: a way to exponentially extend medical dictionaries.
- 1 January 1995
- journal article
- p. 28-32
Abstract
One of the most critical problems of automatic natural language processing (NLP) is the size of the medical lexicons. The set of compound medical words and the continual creation of new terms renders medical lexicons exhaustive beyond question. The structure of such dictionaries usually consists of two parts: 1) the morphological and sometimes syntactical information necessary to identify, on a grapheme level, a given word in a sentence, and 2) the part often devoted to conceptual knowledge associated with the recognized word. It is only when these two prerequisites are fulfilled that an attempt to understand the meaning of a whole expression is possible. The approach developed in this paper is a pragmatic way to rapidly increase the lexico-semantic part of medical dictionaries. We developed a semi-automatic tool, as a prototype to demonstrate the feasibility of this approach. This tool is able to translate almost any diagnosis expressed in French into its equivalent in the ICD-9CM coding scheme.This publication has 0 references indexed in Scilit: