Multiple imputation: review of theory, implementation and software
Top Cited Papers
- 29 January 2007
- journal article
- review article
- Published by Wiley in Statistics in Medicine
- Vol. 26 (16) , 3057-3077
- https://doi.org/10.1002/sim.2787
Abstract
Missing data is a common complication in data analysis. In many medical settings missing data can cause difficulties in estimation, precision and inference. Multiple imputation (MI) (Multiple Imputation for Nonresponse in Surveys. Wiley: New York, 1987) is a simulation‐based approach to deal with incomplete data. Although there are many different methods to deal with incomplete data, MI has become one of the leading methods. Since the late 1980s we observed a constant increase in the use and publication of MI‐related research. This tutorial does not attempt to cover all the material concerning MI, but rather provides an overview and combines together the theory behind MI, the implementation of MI, and discusses increasing possibilities of the use of MI using commercial and free software. We illustrate some of the major points using an example from an Alzheimer disease (AD) study. In this AD study, while clinical data are available for all subjects, postmortem data are only available for the subset of those who died and underwent an autopsy. Analysis of incomplete data requires making unverifiable assumptions. These assumptions are discussed in detail in the text. Relevant S‐Plus code is provided. Copyright © 2007 John Wiley & Sons, Ltd.Keywords
This publication has 43 references indexed in Scilit:
- Using Data Augmentation to Obtain Standard Errors and Conduct Hypothesis Tests in Latent Class and Latent Transition Analysis.Psychological Methods, 2005
- Finite sample properties of multiple imputation estimatorsThe Annals of Statistics, 2004
- On the performance of random‐coefficient pattern‐mixture models for non‐ignorable drop‐outStatistics in Medicine, 2003
- Multiple Imputation after 18+ YearsJournal of the American Statistical Association, 1996
- Pattern-Mixture Models for Multivariate Incomplete DataJournal of the American Statistical Association, 1993
- Asymptotic Results for Multiple ImputationThe Annals of Statistics, 1988
- The Calculation of Posterior Distributions by Data AugmentationJournal of the American Statistical Association, 1987
- The Calculation of Posterior Distributions by Data Augmentation: Comment: A Noniterative Sampling/Importance Resampling Alternative to the Data Augmentation Algorithm for Creating a Few Imputations When Fractions of Missing Information Are Modest: The SIR AlgorithmJournal of the American Statistical Association, 1987
- Multiple Imputation for Interval Estimation from Simple Random Samples with Ignorable NonresponseJournal of the American Statistical Association, 1986
- Tobit models: A surveyJournal of Econometrics, 1984