Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology
Top Cited Papers
Open Access
- 11 December 2007
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Medicine
- Vol. 4 (12) , e352
- https://doi.org/10.1371/journal.pmed.0040352
Abstract
In conventional epidemiology confounding of the exposure of interest with lifestyle or socioeconomic factors, and reverse causation whereby disease status influences exposure rather than vice versa, may invalidate causal interpretations of observed associations. Conversely, genetic variants should not be related to the confounding factors that distort associations in conventional observational epidemiological studies. Furthermore, disease onset will not influence genotype. Therefore, it has been suggested that genetic variants that are known to be associated with a modifiable (nongenetic) risk factor can be used to help determine the causal effect of this modifiable risk factor on disease outcomes. This approach, mendelian randomization, is increasingly being applied within epidemiological studies. However, there is debate about the underlying premise that associations between genotypes and disease outcomes are not confounded by other risk factors. We examined the extent to which genetic variants, on the one hand, and nongenetic environmental exposures or phenotypic characteristics on the other, tend to be associated with each other, to assess the degree of confounding that would exist in conventional epidemiological studies compared with mendelian randomization studies. We estimated pairwise correlations between nongenetic baseline variables and genetic variables in a cross-sectional study comparing the number of correlations that were statistically significant at the 5%, 1%, and 0.01% level (α = 0.05, 0.01, and 0.0001, respectively) with the number expected by chance if all variables were in fact uncorrelated, using a two-sided binomial exact test. We demonstrate that behavioural, socioeconomic, and physiological factors are strongly interrelated, with 45% of all possible pairwise associations between 96 nongenetic characteristics (n = 4,560 correlations) being significant at the p < 0.01 level (the ratio of observed to expected significant associations was 45; p-value for difference between observed and expected < 0.000001). Similar findings were observed for other levels of significance. In contrast, genetic variants showed no greater association with each other, or with the 96 behavioural, socioeconomic, and physiological factors, than would be expected by chance. These data illustrate why observational studies have produced misleading claims regarding potentially causal factors for disease. The findings demonstrate the potential power of a methodology that utilizes genetic variants as indicators of exposure level when studying environmentally modifiable risk factors.Keywords
This publication has 43 references indexed in Scilit:
- Lactase persistence-related genetic variant: population substructure and health outcomesEuropean Journal of Human Genetics, 2008
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A Simple and Improved Correction for Population Stratification in Case-Control StudiesAmerican Journal of Human Genetics, 2007
- Case-Control Inference of Interaction between Genetic and Nongenetic Risk Factors under Assumptions on Their DistributionStatistical Applications in Genetics and Molecular Biology, 2007
- Insight into the nature of the CRP–coronary event association using Mendelian randomizationInternational Journal of Epidemiology, 2006
- Demonstrating stratification in a European American populationNature Genetics, 2005
- Exploiting gene‐environment independence in family‐based case‐control studies: Increased power for detecting associations, interactions and joint effectsGenetic Epidemiology, 2004
- Association of Cholesteryl Ester Transfer Protein– Taq IB Polymorphism With Variations in Lipoprotein Subclasses and Coronary Heart Disease RiskArteriosclerosis, Thrombosis, and Vascular Biology, 2000
- New Functional Promoter Polymorphism, CETP/−629, in Cholesteryl Ester Transfer Protein (CETP) Gene Related to CETP Mass and High Density Lipoprotein Cholesterol LevelsArteriosclerosis, Thrombosis, and Vascular Biology, 2000
- Bias in relative odds estimation owing to imprecise measurement of correlated exposuresStatistics in Medicine, 1992