Gaussianization‐based quasi‐imputation and expansion strategies for incomplete correlated binary responses
- 4 April 2006
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 26 (4) , 782-799
- https://doi.org/10.1002/sim.2560
Abstract
New quasi‐imputation and expansion strategies for correlated binary responses are proposed by borrowing ideas from random number generation. The core idea is to convert correlated binary outcomes to multivariate normal outcomes in a sensible way so that re‐conversion to the binary scale, after performing multiple imputation, yields the original specified marginal expectations and correlations. This conversion process ensures that the correlations are transformed reasonably which in turn allows us to take advantage of well‐developed imputation techniques for Gaussian outcomes. We use the phrase ‘quasi’ because the original observations are not guaranteed to be preserved. We argue that if the inferential goals are well‐defined, it is not necessary to strictly adhere to the established definition of multiple imputation. Our expansion scheme employs a similar strategy where imputation is used as an intermediate step. It leads to proportionally inflated observed patterns, forcing the data set to a complete rectangular format. The plausibility of the proposed methodology is examined by applying it to a wide range of simulated data sets that reflect alternative assumptions on complete data populations and missing‐data mechanisms. We also present an application using a data set from obesity research. We conclude that the proposed method is a promising tool for handling incomplete longitudinal or clustered binary outcomes under ignorable non‐response mechanisms. Copyright © 2006 John Wiley & Sons, Ltd.Keywords
This publication has 30 references indexed in Scilit:
- Multiple imputation under Bayesianly smoothed pattern-mixture models for non-ignorable drop-outStatistics in Medicine, 2005
- Simulation driven inferences for multiply imputed longitudinal datasets*Statistica Neerlandica, 2004
- On the performance of random‐coefficient pattern‐mixture models for non‐ignorable drop‐outStatistics in Medicine, 2003
- Multiple Imputation after 18+ YearsJournal of the American Statistical Association, 1996
- Illustration of Bayesian Inference in Normal Data Models Using Gibbs SamplingJournal of the American Statistical Association, 1990
- The Calculation of Posterior Distributions by Data AugmentationJournal of the American Statistical Association, 1987
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986
- The central role of the propensity score in observational studies for causal effectsBiometrika, 1983
- Formalizing Subjective Notions about the Effect of Nonrespondents in Sample SurveysJournal of the American Statistical Association, 1977