Extracting structured information from free text pathology reports.
- 1 January 2003
- journal article
- research article
- Vol. 2003, 584-8
Abstract
We have developed a method that extracts structured information about specimens and their related findings in free-text surgical pathology reports. Our method uses regular expressions that drive a state-automaton on top of XSLT and Java. Text fragments identified are coded against the UMLS. This paper describes the technical approach and reports on a preliminary evaluation study, designed to guide further development. We found that of 275 reviewed reports, 91% were coded at least so that all specimens and their critical pathologic findings were represented in codes.This publication has 7 references indexed in Scilit:
- A successful technique for removing names in pathology reports using an augmented search and replace method.2002
- Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLSJournal of the American Medical Informatics Association, 2001
- Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.2001
- Quality Assurance in Anatomic Pathology: Automated SNOMED CodingJournal of the American Medical Informatics Association, 1996
- Unlocking Clinical Data from Narrative Reports: A Study of Natural Language ProcessingAnnals of Internal Medicine, 1995
- Experience with a mixed semantic/syntactic parser.1995
- Natural Language Processing and the Representation of Clinical DataJournal of the American Medical Informatics Association, 1994