Comparison of statistical methods for analysis of clustered binary observations
- 10 November 2004
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 24 (6) , 911-923
- https://doi.org/10.1002/sim.1958
Abstract
When correlated observations are obtained in a randomized controlled trial, the assumption of independence among observations within cluster likely will not hold because the observations share the same cluster (e.g. clinic, physician, or subject). Further, the outcome measurements of interest are often binary. The objective of this paper is to compare the performance of four statistical methods for analysis of clustered binary observations: namely (1) full likelihood method; (2) penalized quasi‐likelihood method; (3) generalized estimating equation method; (4) fixed‐effects logistic regression method. The first three methods take correlations into account in inferential processes whereas the last method does not. Type I error rate, power, bias, and standard error are compared across the four statistical methods through computer simulations under varying effect sizes, intraclass correlation coefficients, number of clusters, and number of observations per cluster, including large numbers 20 and 100 of observations per cluster. The results show that the performance of the full likelihood and the penalized quasi‐likelihood methods is superior for analysis of clustered binary observations, and is not necessarily inferior to that of the fixed‐effects logistic regression fit even when within‐cluster correlations are zero. Copyright © 2004 John Wiley & Sons, Ltd.Keywords
This publication has 24 references indexed in Scilit:
- Sample-Size Requirements for Comparisons of Two Groups on Repeated Observations of a Binary OutcomeEvaluation & the Health Professions, 2004
- Bias Correction in Generalized Linear Mixed Models with Multiple Components of DispersionJournal of the American Statistical Association, 1996
- Small sample characteristics of generalized estimating equationsCommunications in Statistics - Simulation and Computation, 1995
- A comparison of the generalized estimating equation approach with the maximum likelihood approach for repeated measurementsStatistics in Medicine, 1993
- Approximate Inference in Generalized Linear Mixed ModelsJournal of the American Statistical Association, 1993
- On some small sample properties of generalized estimating equationEstimates for multivariate dichotomous outcomesJournal of Statistical Computation and Simulation, 1992
- Generalized Linear Models with Random Effects; a Gibbs Sampling ApproachJournal of the American Statistical Association, 1991
- The Evaluation of Integrals of the Form +∞ -∞ f(t)exp(- t 2 ) dt: Application to Logistic-Normal ModelsJournal of the American Statistical Association, 1990
- Longitudinal data analysis using generalized linear modelsBiometrika, 1986
- Maximum Likelihood Approaches to Variance Component Estimation and to Related ProblemsJournal of the American Statistical Association, 1977