Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete Data
Top Cited Papers
Open Access
- 1 November 2007
- journal article
- Published by Institute of Mathematical Statistics in Statistical Science
- Vol. 22 (4) , 523-539
- https://doi.org/10.1214/07-sts227
Abstract
When outcomes are missing for reasons beyond an investigator’s control, there are two different ways to adjust a parameter estimate for covariates that may be related both to the outcome and to missingness. One approach is to model the relationships between the covariates and the outcome and use those relationships to predict the missing values. Another is to model the probabilities of missingness given the covariates and incorporate them into a weighted or stratified estimate. Doubly robust (DR) procedures apply both types of model simultaneously and produce a consistent estimate of the parameter if either of the two models has been correctly specified. In this article, we show that DR estimates can be constructed in many ways. We compare the performance of various DR and non-DR estimates of a population mean in a simulated example where both models are incorrect but neither is grossly misspecified. Methods that use inverse-probabilities as weights, whether they are DR or not, are sensitive to misspecification of the propensity model when some estimated propensities are small. Many DR methods perform better than simple inverse-probability weighting. None of the DR methods we tried, however, improved upon the performance of simple regression-based prediction of the missing values. This study does not represent every missing-data problem that will arise in practice. But it does demonstrate that, in at least some settings, two wrong models are not better than one.Keywords
All Related Versions
This publication has 42 references indexed in Scilit:
- Causal Inference Using Potential OutcomesJournal of the American Statistical Association, 2005
- Semiparametric Efficiency in Multivariate Regression Models with Missing DataJournal of the American Statistical Association, 1995
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995
- Estimation of Regression Coefficients When Some Regressors are not Always ObservedJournal of the American Statistical Association, 1994
- Bayesian Analysis of Binary and Polychotomous Response DataJournal of the American Statistical Association, 1993
- Statistics and Causal InferenceJournal of the American Statistical Association, 1986
- Constructing a Control Group Using Multivariate Matched Sampling Methods That Incorporate the Propensity ScoreThe American Statistician, 1985
- The central role of the propensity score in observational studies for causal effectsBiometrika, 1983
- Characterizing the Estimation of Parameters in Incomplete-Data ProblemsJournal of the American Statistical Association, 1974
- A Generalization of Sampling Without Replacement from a Finite UniverseJournal of the American Statistical Association, 1952