The merits of breaking the matches: a cautionary tale
- 23 August 2006
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 26 (9) , 2036-2051
- https://doi.org/10.1002/sim.2662
Abstract
Matched-pair cluster randomization trials are frequently adopted as the design of choice for evaluating an intervention offered at the community level. However, previous research has demonstrated that a strategy of breaking the matches and performing an unmatched analysis may be more efficient than performing a matched analysis on the resulting data, particularly when the total number of communities is small and the matching is judged as relatively ineffective. The research concerning this question has naturally focused on testing the effect of intervention. However, a secondary objective of many community intervention trials is to investigate the effect of individual-level risk factors on one or more outcome variables. Focusing on the case of a continuous outcome variable, we show that the practice of performing an unmatched analysis on data arising from a matched-pair design can lead to bias in the estimated regression coefficient, and a corresponding test of significance which is overly liberal. However, for large-scale community intervention trials, which typically recruit a relatively small number of large clusters, such an analysis will generally be both valid and efficient. We also consider other approaches to testing the effect of an individual-level risk factor in a matched-pair cluster randomization design, including a generalized linear model approach that preserves the matching, a two-stage cluster-level analysis, and an approach based on generalized estimating equations. Copyright © 2006 John Wiley & Sons, Ltd.Keywords
This publication has 28 references indexed in Scilit:
- Effectiveness of paramedic practitioners in attending 999 calls from elderly people in the community: cluster randomised controlled trialBMJ, 2007
- Small‐sample adjustments in using the sandwich variance estimator in generalized estimating equationsStatistics in Medicine, 2002
- Work site-based cancer prevention: primary results from the Working Well Trial.American Journal of Public Health, 1996
- Breaking the matches in a paired t‐test for community interventions when the number of pairs is smallStatistics in Medicine, 1995
- The effect of matching on the power of randomized community intervention studiesStatistics in Medicine, 1993
- Aspects of statistical design for the community intervention trial for smoking cessation (COMMIT)Controlled Clinical Trials, 1992
- Monte Carlo Comparison of ANOVA, MIVQUE, REML, and ML Estimators of Variance ComponentsTechnometrics, 1984
- The Effect of Two-Stage Sampling on Ordinary Least Squares MethodsJournal of the American Statistical Association, 1982
- Determining the effects of intraclass correlation on factorial experimentsCommunications in Statistics - Theory and Methods, 1980