Multilevel Modelling of Complex Survey Data
Top Cited Papers
- 16 May 2006
- journal article
- Published by Oxford University Press (OUP) in Journal of the Royal Statistical Society Series A: Statistics in Society
- Vol. 169 (4) , 805-827
- https://doi.org/10.1111/j.1467-985x.2006.00426.x
Abstract
Summary: Multilevel modelling is sometimes used for data from complex surveys involving multistage sampling, unequal sampling probabilities and stratification. We consider generalized linear mixed models and particularly the case of dichotomous responses. A pseudolikelihood approach for accommodating inverse probability weights in multilevel models with an arbitrary number of levels is implemented by using adaptive quadrature. A sandwich estimator is used to obtain standard errors that account for stratification and clustering. When level 1 weights are used that vary between elementary units in clusters, the scaling of the weights becomes important. We point out that not only variance components but also regression coefficients can be severely biased when the response is dichotomous. The pseudolikelihood methodology is applied to complex survey data on reading proficiency from the American sample of the ‘Program for international student assessment’ 2000 study, using the Stata program gllamm which can estimate a wide range of multilevel and latent variable models. Performance of pseudo-maximum-likelihood with different methods for handling level 1 weights is investigated in a Monte Carlo experiment. Pseudo-maximum-likelihood estimators of (conditional) regression coefficients perform well for large cluster sizes but are biased for small cluster sizes. In contrast, estimators of marginal effects perform well in both situations. We conclude that caution must be exercised in pseudo-maximum-likelihood estimation for small cluster sizes when level 1 weights are used.Keywords
All Related Versions
This publication has 44 references indexed in Scilit:
- MODEL‐BASED VARIANCE ESTIMATION IN SURVEYS WITH STRATIFIED CLUSTERED DESIGNAustralian Journal of Statistics, 1996
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995
- Analysis of Semiparametric Regression Models for Repeated Outcomes in the Presence of Missing DataJournal of the American Statistical Association, 1995
- Approximate Inference in Generalized Linear Mixed ModelsJournal of the American Statistical Association, 1993
- Nonlinear Multilevel Models, with an Application to Discrete Response DataBiometrika, 1991
- Some Common Problems in Log-Linear AnalysisSociological Methods & Research, 1987
- Social Class Segregation and Its Relationship to Pupils' Examination Results in ScotlandAmerican Sociological Review, 1986
- Models for Nonresponse in Sample SurveysJournal of the American Statistical Association, 1982
- Models for Nonresponse in Sample SurveysJournal of the American Statistical Association, 1982
- Survey Design under the Regression Superpopulation ModelJournal of the American Statistical Association, 1982