Combining estimates of interest in prognostic modelling studies after multiple imputation: current practice and guidelines
Top Cited Papers
Open Access
- 28 July 2009
- journal article
- practice guideline
- Published by Springer Nature in BMC Medical Research Methodology
- Vol. 9 (1) , 57
- https://doi.org/10.1186/1471-2288-9-57
Abstract
Multiple imputation (MI) provides an effective approach to handle missing covariate data within prognostic modelling studies, as it can properly account for the missing data uncertainty. The multiply imputed datasets are each analysed using standard prognostic modelling techniques to obtain the estimates of interest. The estimates from each imputed dataset are then combined into one overall estimate and variance, incorporating both the within and between imputation variability. Rubin's rules for combining these multiply imputed estimates are based on asymptotic theory. The resulting combined estimates may be more accurate if the posterior distribution of the population parameter of interest is better approximated by the normal distribution. However, the normality assumption may not be appropriate for all the parameters of interest when analysing prognostic modelling studies, such as predicted survival probabilities and model performance measures.Keywords
This publication has 36 references indexed in Scilit:
- Association of physical activity with cancer incidence, mortality, and survival: a population-based study of menBritish Journal of Cancer, 2008
- High HIV incidence in a community with high HIV prevalence in rural South Africa: findings from a prospective population-based studyAIDS, 2008
- Variable selection under multiple imputation using the bootstrap in a prognostic studyBMC Medical Research Methodology, 2007
- How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation TheoryPrevention Science, 2007
- Explained randomness in proportional hazards modelsStatistics in Medicine, 2004
- A new measure of prognostic separation in survival dataStatistics in Medicine, 2004
- Applied Survival Analysis: Regression Modeling of Time to Event DataJournal of the American Statistical Association, 2000
- CD4+ lymphocytes and tuberculin skin test as survival predictors in pulmonary tuberculosis HIV-infected patientsInternational Journal of Epidemiology, 1998
- Multiple imputation in health‐are databases: An overview and some applicationsStatistics in Medicine, 1991
- Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable NonresponseJournal of the American Statistical Association, 1986