Machine Learning and Rule-based Approaches to Assertion Classification
Open Access
- 1 January 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 16 (1) , 109-115
- https://doi.org/10.1197/jamia.m2950
Abstract
Objectives: The authors study two approaches to assertion classification. One of these approaches, Extended NegEx (ENegEx), extends the rule-based NegEx algorithm to cover alter-association assertions; the other, Statistical Assertion Classifier (StAC), presents a machine learning solution to assertion classification. Design: For each mention of each medical problem, both approaches determine whether the problem, as asserted by the context of that mention, is present, absent, or uncertain in the patient, or associated with someone other than the patient. The authors use these two systems to (1) extend negation and uncertainty extraction to recognition of alter-association assertions, (2) determine the contribution of lexical and syntactic context to assertion classification, and (3) test if a machine learning approach to assertion classification can be as generally applicable and useful as its rule-based counterparts. Measurements: The authors evaluated assertion classification approaches with precision, recall, and F-measure. Results: The ENegEx algorithm is a general algorithm that can be directly applied to new corpora. Despite being based on machine learning, StAC can also be applied out-of-the-box to new corpora and achieve similar generality. Conclusion: The StAC models that are developed on discharge summaries can be successfully applied to radiology reports. These models benefit the most from words found in the ± 4 word window of the target and can outperform ENegEx.Keywords
This publication has 12 references indexed in Scilit:
- Syntactically-informed semantic category recognition in discharge summaries.2006
- A controlled trial of automated classification of negation from clinical notesBMC Medical Informatics and Decision Making, 2005
- Adding a medical lexicon to an English Parser.2003
- Use of General-purpose Negation Detection to Augment Concept Indexing of Medical Documents: A Quantitative Study Using the UMLSJournal of the American Medical Informatics Association, 2001
- A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge SummariesJournal of Biomedical Informatics, 2001
- Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.2001
- Automatic Detection of Acute Bacterial Pneumonia from Chest X-ray ReportsJournal of the American Medical Informatics Association, 2000
- A Tutorial on Support Vector Machines for Pattern RecognitionData Mining and Knowledge Discovery, 1998
- Unlocking Clinical Data from Narrative Reports: A Study of Natural Language ProcessingAnnals of Internal Medicine, 1995
- A General Natural-language Text Processor for Clinical RadiologyJournal of the American Medical Informatics Association, 1994