Creating a text classifier to detect radiology reports describing mediastinal findings associated with inhalational anthrax and other disorders
Open Access
- 31 October 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Journal of the American Medical Informatics Association
- Vol. 10 (5) , 494-503
- https://doi.org/10.1197/jamia.m1330
Abstract
Objective: The aim of this study was to create a classifier for automatic detection of chest radiograph reports consistent with the mediastinal findings of inhalational anthrax. Design: The authors used the Identify Patient Sets (IPS) system to create a key word classifier for detecting reports describing mediastinal findings consistent with anthrax and compared their performances on a test set of 79,032 chest radiograph reports. Measurements: Area under the ROC curve was the main outcome measure of the IPS classifier. Sensitivity and specificity of an initial IPS model were calculated based on an existing key word search and were compared against a Boolean version of the IPS classifier. Results: The IPS classifier received an area under the ROC curve of 0.677 (90% CI = 0.628 to 0.772) with a specificity of 0.99 and maximum sensitivity of 0.35. The initial IPS model attained a specificity of 1.0 and a sensitivity of 0.04. Conclusion: The IPS system is a useful tool for helping domain experts create a statistical key word classifier for textual reports that is a potentially useful component in surveillance of radiographic findings suspicious for anthrax.Keywords
This publication has 34 references indexed in Scilit:
- Detection of Pediatric Respiratory and Diarrheal Outbreaks from Sales of Over-the-counter Electrolyte ProductsJournal of the American Medical Informatics Association, 2003
- Bioterrorism-Related Inhalational Anthrax: The First 10 Cases Reported in the United StatesEmerging Infectious Diseases, 2001
- A Comparison of Classification Algorithms to Automatically Identify Chest X-Ray Reports That Support PneumoniaJournal of Biomedical Informatics, 2001
- The Emerging Science of Very Early Detection of Disease OutbreaksJournal of Public Health Management & Practice, 2001
- Coding Neuroradiology Reports for the Northern Manhattan Stroke Study: A Comparison of Natural Language Processing and Manual ReviewComputers and Biomedical Research, 2000
- AnthraxNew England Journal of Medicine, 1999
- Text-learning and related intelligent agents: a surveyIEEE Intelligent Systems and their Applications, 1999
- The Sverdlovsk Anthrax Outbreak of 1979Science, 1994
- Sensitivity, Specificity and predictive Values of Health Service Based Indicators for the Surveillance of Influenza A EpidemicsInternational Journal of Epidemiology, 1994
- Basic principles of ROC analysisSeminars in Nuclear Medicine, 1978