Multiple Imputation for Model Checking: Completed‐Data Plots with Missing and Latent Data
- 28 February 2005
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 61 (1) , 74-85
- https://doi.org/10.1111/j.0006-341x.2005.031010.x
Abstract
SummaryIn problems with missing or latent data, a standard approach is to first impute the unobserved data, then perform all statistical analyses on thecompleteddataset—corresponding to the observed data and imputed unobserved data—using standard procedures for complete‐data inference. Here, we extend this approach to model checking by demonstrating the advantages of the use of completed‐data model diagnostics on imputed completed datasets. The approach is set in the theoretical framework of Bayesian posterior predictive checks (but, as with missing‐data imputation, our methods of missing‐data model checking can also be interpreted as “predictive inference” in a non‐Bayesian context). We consider the graphical diagnostics within this framework. Advantages of the completed‐data approach include: (1) One can often check model fit in terms of quantities that are of key substantive interest in a natural way, which is not always possible using observed data alone. (2) In problems with missing data, checks may be devised that do not require to model the missingness or inclusion mechanism; the latter is useful for the analysis of ignorable but unknown data collection mechanisms, such as are often assumed in the analysis of sample surveys and observational studies. (3) In many problems with latent data, it is possible to check qualitative features of the model (for example, independence of two variables) that can be naturally formalized with the help of the latent data. We illustrate with several applied examples.Keywords
This publication has 32 references indexed in Scilit:
- Exploratory Data Analysis for Complex ModelsJournal of Computational and Graphical Statistics, 2004
- Not Asked and Not Answered: Multiple Imputation for Multiple SurveysJournal of the American Statistical Association, 1998
- Analysis of Nonrandomly Censored Ordered Categorical Longitudinal Data from Analgesic Trials: CommentJournal of the American Statistical Association, 1997
- Analysis of Nonrandomly Censored Ordered Categorical Longitudinal Data from Analgesic TrialsJournal of the American Statistical Association, 1997
- Multiple Imputation after 18+ YearsJournal of the American Statistical Association, 1996
- Bayes FactorsJournal of the American Statistical Association, 1995
- Inference from Coarse Data via Multiple Imputation with Application to Age HeapingJournal of the American Statistical Association, 1990
- The Calculation of Posterior Distributions by Data AugmentationJournal of the American Statistical Association, 1987
- Estimating a Population of Parameter Values Using Bayes and Empirical Bayes MethodsJournal of the American Statistical Association, 1984
- A Predictive Approach to Model SelectionJournal of the American Statistical Association, 1979