Analysis of incomplete longitudinal binary data using multiple imputation
- 11 October 2005
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 25 (12) , 2107-2124
- https://doi.org/10.1002/sim.2343
Abstract
We propose a propensity score‐based multiple imputation (MI) method to tackle incomplete missing data resulting from drop‐outs and/or intermittent skipped visits in longitudinal clinical trials with binary responses. The estimation and inferential properties of the proposed method are contrasted via simulation with those of the commonly used complete‐case (CC) and generalized estimating equations (GEE) methods. Three key results are noted. First, if data are missing completely at random, MI can be notably more efficient than the CC and GEE methods. Second, with small samples, GEE often fails due to ‘convergence problems’, but MI is free of that problem. Finally, if the data are missing at random, while the CC and GEE methods yield results with moderate to large bias, MI generally yields results with negligible bias. A numerical example with real data is provided for illustration. Copyright © 2005 John Wiley & Sons, Ltd.Keywords
This publication has 21 references indexed in Scilit:
- Comparison of statistical methods for analysis of clustered binary observationsStatistics in Medicine, 2004
- Performance of weighted estimating equations for longitudinal binary data with drop‐outs missing at randomStatistics in Medicine, 2002
- The Generalized Estimating Equation Approach When Data are Not Missing Completely at RandomJournal of the American Statistical Association, 1997
- A Simple Method for Generating Correlated Binary VariatesThe American Statistician, 1996
- A multiple imputation strategy for clinical trials with truncation of patient dataStatistics in Medicine, 1995
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995
- Multiple imputation in health‐are databases: An overview and some applicationsStatistics in Medicine, 1991
- Repeated measurement analysis for nonnormal data in small samplesCommunications in Statistics - Simulation and Computation, 1988
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986
- The Bayesian BootstrapThe Annals of Statistics, 1981