Description of a Rule-based System for the i2b2 Challenge in Natural Language Processing for Clinical Data
Open Access
- 1 July 2009
- journal article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 16 (4) , 571-575
- https://doi.org/10.1197/jamia.M3083
Abstract
The Obesity Challenge, sponsored by Informatics for Integrating Biology and the Bedside (i2b2), a National Center for Biomedical Computing, asked participants to build software systems that could “read” a patient's clinical discharge summary and replicate the judgments of physicians in evaluating presence or absence of obesity and 15 comorbidities. The authors describe their methodology and discuss the results of applying Lockheed Martin's rule-based natural language processing (NLP) capability, ClinREAD. We tailored ClinREAD with medical domain expertise to create assigned default judgments based on the most probable results as defined in the ground truth. It then used rules to collect evidence similar to the evidence that the human judges likely relied upon, and applied a logic module to weigh the strength of all evidence collected to arrive at final judgments. The Challenge results suggest that rule-based systems guided by human medical expertise are capable of solving complex problems in machine processing of medical text.Keywords
This publication has 8 references indexed in Scilit:
- Recognizing Obesity and Comorbidities in Sparse DataJournal of the American Medical Informatics Association, 2009
- NLP-based identification of pneumonia cases from free-text radiological reports.2008
- Identifying Smokers with a Medical Extraction SystemJournal of the American Medical Informatics Association, 2008
- Medical i2b2 NLP Smoking Challenge: The A-Life System Architecture and MethodologyJournal of the American Medical Informatics Association, 2008
- Mayo Clinic NLP System for Patient Smoking Status IdentificationJournal of the American Medical Informatics Association, 2008
- Identifying Patient Smoking Status from Medical Discharge RecordsJournal of the American Medical Informatics Association, 2008
- Epidemiology of angina pectoris: Role of natural language processing of the medical recordAmerican Heart Journal, 2007
- A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge SummariesJournal of Biomedical Informatics, 2001