Discovery Properties of Genome-wide Association Signals From Cumulatively Combined Data Sets
Open Access
- 6 October 2009
- journal article
- review article
- Published by Oxford University Press (OUP) in American Journal of Epidemiology
- Vol. 170 (10) , 1197-1206
- https://doi.org/10.1093/aje/kwp262
Abstract
Genetic effects for common variants affecting complex disease risk are subtle. Single genome-wide association (GWA) studies are typically underpowered to detect these effects, and combination of several GWA data sets is needed to enhance discovery. The authors investigated the properties of the discovery process in simulated cumulative meta-analyses of GWA study-derived signals allowing for potential genetic model misspecification and between-study heterogeneity. Variants with null effects on average (but also between-data set heterogeneity) could yield false-positive associations with seemingly homogeneous effects. Random effects had higher than appropriate false-positive rates when there were few data sets. The log-additive model had the lowest false-positive rate. Under heterogeneity, random-effects meta-analyses of 2–10 data sets averaging 1,000 cases/1,000 controls each did not increase power, or the meta-analysis was even less powerful than a single study (power desert). Upward bias in effect estimates and underestimation of between-study heterogeneity were common. Fixed-effects calculations avoided power deserts and maximized discovery of association signals at the expense of much higher false-positive rates. Therefore, random- and fixed-effects models are preferable for different purposes (fixed effects for initial screenings, random effects for generalizability applications). These results may have broader implications for the design and interpretation of large-scale multiteam collaborative studies discovering common gene variants.Keywords
This publication has 35 references indexed in Scilit:
- Large-Scale Analysis of Association Between LRP5 and LRP6 Variants and OsteoporosisJAMA, 2008
- Genome‐wide significance for dense SNP and resequencing dataGenetic Epidemiology, 2008
- Required sample size and nonreplicability thresholds for heterogeneous genetic associationsProceedings of the National Academy of Sciences, 2008
- Newly identified loci that influence lipid concentrations and risk of coronary artery diseaseNature Genetics, 2008
- Letting the Genome out of the Bottle — Will We Get Our Wish?New England Journal of Medicine, 2008
- Confidence intervals for the overall effect size in random-effects meta-analysis.Psychological Methods, 2008
- Assessment of cumulative evidence on genetic associations: interim guidelinesInternational Journal of Epidemiology, 2007
- Heterogeneity in Meta-Analyses of Genome-Wide Association InvestigationsPLOS ONE, 2007
- Meta‐analysis of genetic association studies under different inheritance models using data reported as merged genotypesStatistics in Medicine, 2007
- Non-Replication and Inconsistency in the Genome-Wide Association SettingHuman Heredity, 2007