Review of inverse probability weighting for dealing with missing data
Top Cited Papers
- 10 January 2011
- journal article
- research article
- Published by SAGE Publications in Statistical Methods in Medical Research
- Vol. 22 (3) , 278-295
- https://doi.org/10.1177/0962280210395740
Abstract
The simplest approach to dealing with missing data is to restrict the analysis to complete cases, i.e. individuals with no missing values. This can induce bias, however. Inverse probability weighting (IPW) is a commonly used method to correct this bias. It is also used to adjust for unequal sampling fractions in sample surveys. This article is a review of the use of IPW in epidemiological research. We describe how the bias in the complete-case analysis arises and how IPW can remove it. IPW is compared with multiple imputation (MI) and we explain why, despite MI generally being more efficient, IPW may sometimes be preferred. We discuss the choice of missingness model and methods such as weight truncation, weight stabilisation and augmented IPW. The use of IPW is illustrated on data from the 1958 British Birth Cohort.Keywords
This publication has 57 references indexed in Scilit:
- Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete dataBiometrika, 2009
- Handling missing data by deleting completely observed recordsJournal of Statistical Planning and Inference, 2009
- Constructing Inverse Probability Weights for Marginal Structural ModelsAmerican Journal of Epidemiology, 2008
- Psychosocial work characteristics and anxiety and depressive disorders in midlife: the effects of prior psychological distressOccupational and Environmental Medicine, 2008
- How many mailouts? Could attempts to increase the response rate in the Iraq war cohort study be counterproductive?BMC Medical Research Methodology, 2007
- Demystifying Double Robustness: A Comparison of Alternative Strategies for Estimating a Population Mean from Incomplete DataStatistical Science, 2007
- Much Ado About NothingThe American Statistician, 2007
- Variable Selection for Propensity Score ModelsAmerican Journal of Epidemiology, 2006
- Semiparametric Regression for Repeated Outcomes with Nonignorable NonresponseJournal of the American Statistical Association, 1998
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995