Nonrandomly Missing Data in Multiple Regression: An Empirical Comparison of Common Missing-Data Treatments
- 1 September 1994
- journal article
- Published by SAGE Publications in Educational and Psychological Measurement
- Vol. 54 (3) , 573-593
- https://doi.org/10.1177/0013164494054003001
Abstract
This research is an investigation of the effects of nonrandomly missing data in two-predictor regression analyses and the differences in the effectiveness of five common treatments of missing data on estimates of R2 and of each of the two standardized regression weights. Bootstrap samples of 50, 100, and 200 were drawn from three sets of actual field data. Nonrandomly missing data were created within each sample, and the parameter estimates were compared with those obtained from the same samples with no missing data. The results indicated that three imputation procedures (mean substitution, simple and multiple regression imputation) produced biased estimates of R2 and both regression weights. Two deletion procedures (listwise and pairwise) provided accurate parameter estimates with up to 30% of the data missing.Keywords
This publication has 12 references indexed in Scilit:
- An Introduction to Bootstrap MethodsSociological Methods & Research, 1989
- A Comparison of Methods for Treating Incomplete Data in Selection ResearchEducational and Psychological Measurement, 1987
- Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical AccuracyStatistical Science, 1986
- Missing data estimators in the general linear model: an evaluation of simulated data as an experimental designCommunications in Statistics - Simulation and Computation, 1985
- Missing value problems in multiple linear regression with two independent variablesCommunications in Statistics - Theory and Methods, 1982
- The Treatment of Missing Data in Multivariate AnalysisSociological Methods & Research, 1977
- Some Simple Procedures for Handling Missing Data in Multivariate AnalysisPsychometrika, 1976
- A Proposal for Handling Missing DataPsychometrika, 1975
- The Estimation of Variance-Covariance and Correlation Matrices from Incomplete DataPsychometrika, 1970
- Comparison of Three Methods of Handling Missing ObservationsPsychological Reports, 1968