Using Bivariate Models to Understand between‐ and within‐Cluster Regression Coefficients, with Application to Twin Data
- 6 April 2006
- journal article
- Published by Oxford University Press (OUP) in Biometrics
- Vol. 62 (3) , 745-751
- https://doi.org/10.1111/j.1541-0420.2006.00561.x
Abstract
Summary In the regression analysis of clustered data it is important to allow for the possibility of distinct between- and within-cluster exposure effects on the outcome measure, represented, respectively, by regression coefficients for the cluster mean and the deviation of the individual-level exposure value from this mean. In twin data, the within-pair regression effect represents association conditional on exposures shared within pairs, including any common genetic or environmental influences on the outcome measure. It has therefore been proposed that a comparison of the within-pair regression effects between monozygous (MZ) and dizygous (DZ) twins can be used to examine whether the association between exposure and outcome has a genetic origin. We address this issue by proposing a bivariate model for exposure and outcome measurements in twin-pair data. The between- and within-pair regression coefficients are shown to be weighted averages of ratios of the exposure and outcome variances and covariances, from which it is straightforward to determine the conditions under which the within-pair regression effect in MZ pairs will be different from that in DZ pairs. In particular, we show that a correlation structure in twin pairs for exposure and outcome that appears to be due to genetic factors will not necessarily be reflected in distinct MZ and DZ values for the within-pair regression coefficients. We illustrate these results in a study of female twin pairs from Australia and North America relating mammographic breast density to weight and body mass index.Keywords
This publication has 16 references indexed in Scilit:
- Regression models for twin studies: a critical reviewInternational Journal of Epidemiology, 2005
- Studies of twins: what can they tell us about the fetal origins of adult disease?Paediatric and Perinatal Epidemiology, 2005
- Separating within and between effects in family studies: an application to the study of blood pressure in childrenStatistics in Medicine, 2004
- Separation of individual‐level and cluster‐level covariate effects in regression analysis of correlated dataStatistics in Medicine, 2003
- Twins and fetal origins hypothesis: within-pair analysesThe Lancet, 2002
- Evidence for Genetic Factors Explaining the Birth Weight–Blood Pressure RelationHypertension, 2000
- Inference from Iterative Simulation Using Multiple SequencesStatistical Science, 1992
- The Effect of Two-Stage Sampling on Ordinary Least Squares MethodsJournal of the American Statistical Association, 1982
- The Effect of Two-Stage Sampling on Ordinary Least Squares MethodsJournal of the American Statistical Association, 1982
- The Use of Variance Components Models in Pooling Cross Section and Time Series DataEconometrica, 1971