Use and Misuse of the Receiver Operating Characteristic Curve in Risk Prediction
Top Cited Papers
- 20 February 2007
- journal article
- review article
- Published by Wolters Kluwer Health in Circulation
- Vol. 115 (7) , 928-935
- https://doi.org/10.1161/circulationaha.106.672402
Abstract
The c statistic, or area under the receiver operating characteristic (ROC) curve, achieved popularity in diagnostic testing, in which the test characteristics of sensitivity and specificity are relevant to discriminating diseased versus nondiseased patients. The c statistic, however, may not be optimal in assessing models that predict future risk or stratify individuals into risk categories. In this setting, calibration is as important to the accurate assessment of risk. For example, a biomarker with an odds ratio of 3 may have little effect on the c statistic, yet an increased level could shift estimated 10-year cardiovascular risk for an individual patient from 8% to 24%, which would lead to different treatment recommendations under current Adult Treatment Panel III guidelines. Accepted risk factors such as lipids, hypertension, and smoking have only marginal impact on the c statistic individually yet lead to more accurate reclassification of large proportions of patients into higher-risk or lower-risk categories. Perfectly calibrated models for complex disease can, in fact, only achieve values for the c statistic well below the theoretical maximum of 1. Use of the c statistic for model selection could thus naively eliminate established risk factors from cardiovascular risk prediction scores. As novel risk factors are discovered, sole reliance on the c statistic to evaluate their utility as risk predictors thus seems ill-advised.Keywords
This publication has 28 references indexed in Scilit:
- Comparative Impact of Multiple Biomarkers and N-Terminal Pro-Brain Natriuretic Peptide in the Context of Conventional Risk Factors for the Prediction of Recurrent Cardiovascular Events in the Heart Outcomes Prevention Evaluation (HOPE) StudyCirculation, 2006
- An Assessment of Incremental Coronary Risk Prediction Using C-Reactive Protein and Other Novel Risk MarkersArchives of internal medicine (1960), 2006
- On criteria for evaluating models of absolute riskBiostatistics, 2005
- A Randomized Trial of Low-Dose Aspirin in the Primary Prevention of Cardiovascular Disease in WomenNew England Journal of Medicine, 2005
- Should Age and Time Be Eliminated From Cardiovascular Risk Prediction Models?Circulation, 2005
- Limitations of the Odds Ratio in Gauging the Performance of a Diagnostic, Prognostic, or Screening MarkerAmerican Journal of Epidemiology, 2004
- Executive Summary of the Third Report of the National Cholesterol Education Program (NCEP) Expert Panel on Detection, Evaluation, and Treatment of High Blood Cholesterol in Adults (Adult Treatment Panel III)JAMA, 2001
- The (In)Validity of sensitivity and specificityStatistics in Medicine, 2000
- What price perfection? Calibration and discrimination of clinical prediction modelsJournal of Clinical Epidemiology, 1992
- Sick Individuals and Sick PopulationsInternational Journal of Epidemiology, 1985