Regression with MissingX's: A Review
- 1 December 1992
- journal article
- review article
- Published by Taylor & Francis in Journal of the American Statistical Association
- Vol. 87 (420) , 1227-1237
- https://doi.org/10.1080/01621459.1992.10476282
Abstract
The literature of regression analysis with missing values of the independent variables is reviewed. Six classes of procedures are distinguished: complete case analysis, available case methods, least squares on imputed data, maximum likelihood, Bayesian methods, and multiple imputation. Methods are compared and illustrated when missing data are confined to one independent variable, and extensions to more general patterns are indicated. Attention is paid to the performance of methods when the missing data are not missing completely at random. Least squares methods that fill in missing X's using only data on the X's are contrasted with likelihood-based methods that use data on the X's and Y. The latter approach is preferred and provides methods for elaboration of the basic normal linear regression model. It is suggested that more widely distributed software is needed that advances beyond complete-case analysis, available-case analysis, and naive imputation methods. Bayesian simulation methods and multiple imputation are reviewed; these provide fruitful avenues for future research.Keywords
This publication has 56 references indexed in Scilit:
- Estimation of parameters and missing values under a regression model with non‐normally distributed and non‐randomly incomplete dataStatistics in Medicine, 1989
- Tobit models: A surveyJournal of Econometrics, 1984
- Small-Sample Properties of Estimators of Regression Coefficients Given a Common Pattern of Missing DataThe Review of Economic Studies, 1983
- Missing Data: A Review of the LiteraturePublished by Elsevier ,1983
- Missing value problems in multiple linear regression with two independent variablesCommunications in Statistics - Theory and Methods, 1982
- The use of incomplete observations in multiple regression analysisJournal of Econometrics, 1973
- Missing Observations in Multivariate Statistics III: Large Sample Analysis of Simple Linear RegressionJournal of the American Statistical Association, 1969
- Missing Observations in Multivariate Statistics II. Point Estimation in Simple Linear RegressionJournal of the American Statistical Association, 1967
- Missing Observations in Multivariate Statistics I. Review of the LiteratureJournal of the American Statistical Association, 1966
- Maximum Likelihood Estimates for a Multivariate Normal Distribution when Some Observations are MissingJournal of the American Statistical Association, 1957