Sampling Weights and Regression Analysis
- 1 November 1994
- journal article
- research article
- Published by SAGE Publications in Sociological Methods & Research
- Vol. 23 (2) , 230-257
- https://doi.org/10.1177/0049124194023002004
Abstract
Most major population surveys used by social scientists are based on complex sampling designs where sampling units have different probabilities of being selected. Although sampling weights must generally be used to derive unbiased estimates of univariate population characteristics, the decision about their use in regression analysis is more complicated. Where sampling weights are solely a function of independent variables included in the model, unweighted OLS estimates are preferred because they are unbiased, consistent, and have smaller standard errors than weighted OLS estimates. Where sampling weights are a function of the dependent variable (and thus of the error term), we recommend first attempting to respecify the model so that they are solely a function of the independent variables. If this can be accomplished, then unweighted OLS is again preferred. If the model cannot be respecified, then estimation of the model using sampling weights may be appropriate. In this case, however, the formula used by most computer programs for calculating standard errors will be incorrect. We recommend using the White heteroskedastic consistent estimator for the standard errors.This publication has 9 references indexed in Scilit:
- Models for Sample Selection BiasAnnual Review of Sociology, 1992
- A Model-Based Look at Linear Regression with Survey DataThe American Statistician, 1991
- Using Sample Survey Weights in Multiple Regression Analyses of Stratified SamplesJournal of the American Statistical Association, 1983
- An Introduction to Sample Selection Bias in Sociological DataAmerican Sociological Review, 1983
- Maximum Likelihood Estimation of Misspecified ModelsEconometrica, 1982
- A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for HeteroskedasticityEconometrica, 1980
- Regression Analysis of Data from Complex SurveysJournal of the Royal Statistical Society. Series A (General), 1980
- The Estimation of Choice Probabilities from Choice Based SamplesEconometrica, 1977
- Social Experimentation, Truncated Distributions, and Efficient EstimationEconometrica, 1977