Adjustment for Response Bias Via Two-phase Analysis
- 1 November 2009
- journal article
- research article
- Published by Wolters Kluwer Health in Epidemiology
- Vol. 20 (6) , 872-879
- https://doi.org/10.1097/ede.0b013e3181b2ff66
Abstract
Background: Records-based studies often have limited covariate data, leading some researchers to collect survey data on a subset. Results for survey responders may be biased due to selective nonresponse and will be less precise due to the decreased responder sample size. We use data from a study of air pollution and birth outcomes to illustrate how a 2-phase analysis can yield less biased and more precise results. Methods: Our phase 1 group was a cohort of Los Angeles births from which we obtained a phase 2 group of survey responders. We compared estimates for the odds ratio (OR) between entire pregnancy carbon monoxide (CO) exposure and low birth weight in the first- and second-phase groups, adjusting only for variables available for both groups. Results: For CO exposure of 1 part per million or higher, the conventional adjusted ORs and 95% confidence intervals for low birth weight were 1.15 (1.06–1.25) and 1.33 (1.06–1.68) for the phase 1 and 2 groups, suggesting a possible response bias and decreased precision in the latter estimate. We performed 2-phase analyses of the survey responders and found results similar to those for the cohort when we accounted for possible differential response by CO exposure. In our final analysis, we included both birth record and survey variables in a 2-phase model corrected for possible response bias. The results from weighted-, pseudo-, and maximum-likelihood were similar: 1.13 (1.03–1.25); 1.14 (1.01–1.29); and 1.10 (0.97–1.24), respectively. Conclusion: Our approach provides a means of checking for response bias and adjusting both point and interval estimates to account for differential response.Keywords
This publication has 9 references indexed in Scilit:
- Ambient Air Pollution and Preterm Birth in the Environment and Pregnancy Outcomes Study at the University of California, Los AngelesAmerican Journal of Epidemiology, 2007
- Maximum Likelihood Estimation of Logistic Regression Parameters under Two-phase, Outcome-dependent SamplingJournal of the Royal Statistical Society Series B: Statistical Methodology, 1997
- WEIGHTED LIKELIHOOD, PSEUDO-LIKELIHOOD AND MAXIMUM LIKELIHOOD METHODS FOR LOGISTIC REGRESSION ANALYSIS OF TWO-STAGE DATAStatistics in Medicine, 1997
- Estimation of Regression Coefficients When Some Regressors are not Always ObservedJournal of the American Statistical Association, 1994
- Fitting Logistic Regression Models in Stratified Case-Control StudiesPublished by JSTOR ,1991
- Analytic methods for two‐stage case‐control studies and other stratified designsStatistics in Medicine, 1991
- Logistic regression for two-stage case-control dataBiometrika, 1988
- Anamorphic Analysis: Sampling and Estimation for Covariate Effects when Both Exposure and Disease are KnownPublished by JSTOR ,1982
- A TWO STAGE DESIGN FOR THE STUDY OF THE RELATIONSHIP BETWEEN A RARE EXPOSURE AND A RARE DISEASE1American Journal of Epidemiology, 1982