Sample size considerations for the external validation of a multivariable prognostic model: a resampling study
Top Cited Papers
Open Access
- 9 November 2015
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 35 (2) , 214-226
- https://doi.org/10.1002/sim.6787
Abstract
After developing a prognostic model, it is essential to evaluate the performance of the model in samples independent from those used to develop the model, which is often referred to as external validation. However, despite its importance, very little is known about the sample size requirements for conducting an external validation. Using a large real data set and resampling methods, we investigate the impact of sample size on the performance of six published prognostic models. Focussing on unbiased and precise estimation of performance measures (e.g. the c‐index, D statistic and calibration), we provide guidance on sample size for investigators designing an external validation study. Our study suggests that externally validating a prognostic model requires a minimum of 100 events and ideally 200 (or more) events. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.Keywords
Funding Information
- Medical Research Council (G1100513)
- Medical Research Council (G1100513)
- Medical Research Council Prognosis Research Strategy (PROGRESS) Partnership (G0902393/99558)
This publication has 54 references indexed in Scilit:
- External validation of a Cox prognostic model: principles and methodsBMC Medical Research Methodology, 2013
- Prognosis Research Strategy (PROGRESS) 3: Prognostic Model ResearchPLoS Medicine, 2013
- Developing risk prediction models for type 2 diabetes: a systematic review of methodology and reportingBMC Medicine, 2011
- Identifying suspected breast cancer: development and validation of a clinical prediction ruleBritish Journal of General Practice, 2011
- External Validity of Risk Models: Use of Benchmark Values to Disentangle a Case-Mix Effect From Incorrect CoefficientsAmerican Journal of Epidemiology, 2010
- Assessing the Performance of Prediction ModelsEpidemiology, 2010
- Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2BMJ, 2008
- General Cardiovascular Risk Profile for Use in Primary CareCirculation, 2008
- The design of simulation studies in medical statisticsStatistics in Medicine, 2006
- A new measure of prognostic separation in survival dataStatistics in Medicine, 2004