Two‐stage methods for the analysis of pooled data
- 28 June 2001
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 20 (14) , 2115-2130
- https://doi.org/10.1002/sim.852
Abstract
Epidemiologic studies of disease often produce inconclusive or contradictory results due to small sample sizes or regional variations in the disease incidence or the exposures. To clarify these issues, researchers occasionally pool and reanalyse original data from several large studies. In this paper we explore the use of a two-stage random-effects model for analysing pooled case-control studies and undertake a thorough examination of bias in the pooled estimator under various conditions. The two-stage model analyses each study using the model appropriate to the design with study-specific confounders, and combines the individual study-specific adjusted log-odds ratios using a linear mixed-effects model; it is computationally simple and can incorporate study-level covariates and random effects. Simulations indicate that when the individual studies are large, two-stage methods produce nearly unbiased exposure estimates and standard errors of the exposure estimates from a generalized linear mixed model. By contrast, joint fixed-effects logistic regression produces attenuated exposure estimates and underestimates the standard error when heterogeneity is present. While bias in the pooled regression coefficient increases with interstudy heterogeneity for both models, it is much smaller using the two-stage model. In pooled analyses, where covariates may not be uniformly defined and coded across studies, and occasionally not measured in all studies, a joint model is often not feasible. The two-stage method is shown to be a simple, valid and practical method for the analysis of pooled binary data. The results are applied to a study of reproductive history and cutaneous melanoma risk in women using data from ten large case-control studies. Copyright © 2001 John Wiley & Sons, Ltd.Keywords
This publication has 60 references indexed in Scilit:
- Approximate Inference in Generalized Linear Mixed ModelsJournal of the American Statistical Association, 1993
- Dietary Intake of Fiber and Decreased Risk of Cancers of the Colon and Rectum: Evidence From the Combined Analysis of 13 Case-Control StudiesJNCI Journal of the National Cancer Institute, 1992
- The Danish case‐control study of cutaneous malignant melanoma. III. Hormonal and reproductive factors in womenInternational Journal of Cancer, 1988
- Meta-analysis in clinical trialsControlled Clinical Trials, 1986
- Parametric Empirical Bayes Inference: Theory and ApplicationsJournal of the American Statistical Association, 1983
- Bias and Efficiency in Logistic Analyses of Stratified Case-Control StudiesInternational Journal of Epidemiology, 1980
- Some results on the estimation of logistic models based on retrospective dataBiometrika, 1979