Performance of reclassification statistics in comparing risk prediction models
- 3 February 2011
- journal article
- research article
- Published by Wiley in Biometrical Journal
- Vol. 53 (2) , 237-258
- https://doi.org/10.1002/bimj.201000078
Abstract
Concerns have been raised about the use of traditional measures of model fit in evaluating risk prediction models for clinical use, and reclassification tables have been suggested as an alternative means of assessing the clinical utility of a model. Several measures based on the table have been proposed, including the reclassification calibration (RC) statistic, the net reclassification improvement (NRI), and the integrated discrimination improvement (IDI), but the performance of these in practical settings has not been fully examined. We used simulations to estimate the type I error and power for these statistics in a number of scenarios, as well as the impact of the number and type of categories, when adding a new marker to an established or reference model. The type I error was found to be reasonable in most settings, and power was highest for the IDI, which was similar to the test of association. The relative power of the RC statistic, a test of calibration, and the NRI, a test of discrimination, varied depending on the model assumptions. These tools provide unique but complementary information.Keywords
This publication has 41 references indexed in Scilit:
- Assessment of Clinical Validity of a Breast Cancer Risk Model Combining Genetic and Clinical InformationJNCI Journal of the National Cancer Institute, 2010
- Evaluating health risk modelsStatistics in Medicine, 2010
- Comment: Measures to Summarize and Compare the Predictive Capacity of MarkersThe International Journal of Biostatistics, 2010
- Assessing the Performance of Prediction ModelsEpidemiology, 2010
- A Parametric ROC Model‐Based Approach for Evaluating the Predictiveness of Continuous Markers in Case–Control StudiesBiometrics, 2009
- Criteria for Evaluation of Novel Markers of Cardiovascular RiskCirculation, 2009
- Using Relative Utility Curves to Evaluate Risk PredictionJournal of the Royal Statistical Society Series A: Statistics in Society, 2009
- Measures to Summarize and Compare the Predictive Capacity of MarkersThe International Journal of Biostatistics, 2009
- Integrating the Predictiveness of a Marker with Its Performance as a ClassifierAmerican Journal of Epidemiology, 2007
- Decision Curve Analysis: A Novel Method for Evaluating Prediction ModelsMedical Decision Making, 2006