The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation

Top Cited Papers

Open Access

1 February 2009

journal article
research article
Published by Institute of Mathematical Statistics in Statistical Science

Vol. 24 (1) , 29-72
https://doi.org/10.1214/08-sts274

Abstract

A basic feature of many field experiments is that investigators are only able to randomize clusters of individuals-such as households, communities, firms, medical practices, schools or classrooms-even when the individual is the unit of interest. To recoup the resulting efficiency loss, some studies pair similar clusters and randomize treatment within pairs. However, many other studies avoid pairing, in part because of claims in the literature, echoed by clinical trials standards organizations, that this matched-pair, cluster-randomization design has serious problems. We argue that all such claims are unfounded. We also prove that the estimator recommended for this design in the literature is unbiased only in situations when matching is unnecessary; its standard error is also invalid. To overcome this problem without modeling assumptions, we develop a simple design-based estimator with much improved statistical properties. We also propose a model-based approach that includes some of the benefits of our design-based estimator as well as the estimator in the literature. Our methods also address ndividual-level noncompliance, which is common in applications but not allowed for in most existing methods. We show that from the perspective of bias, efficiency, power, robustness or research costs, and in large or small samples, pairing should be used in cluster-randomized experiments whenever feasible; failing to do so is equivalent to discarding a considerable fraction of one's data. We develop these techniques in the context of a randomized evaluation we are conducting of the Mexican Universal Health Insurance Program.

Keywords

All Related Versions

Version 1, 2009-10-20, ArXiv

This publication has 50 references indexed in Scilit:

Public policy for the poor? A randomised assessment of the Mexican universal health insurance programme
The Lancet, 2009
Variance identification and efficiency analysis in randomized experiments under the matched‐pair design
Statistics in Medicine, 2008
Randomization Inference in a Group–Randomized Trial of Treatments for Depression
Journal of the American Statistical Association, 2008
Interference Between Units in Randomized Experiments
Journal of the American Statistical Association, 2007
What Do Randomized Studies of Housing Mobility Demonstrate?
Journal of the American Statistical Association, 2006
Using Cluster Randomized Field Experiments to Study Voting Behavior
The Annals of the American Academy of Political and Social Science, 2005
Evidence-based health policy: three generations of reform in Mexico
The Lancet, 2003
Statistical analysis and optimal design for cluster randomized trials.
Psychological Methods, 1997
Statistics and Causal Inference
Journal of the American Statistical Association, 1986
Interval estimation of location difference with incomplete data
Biometrika, 1982