Rare Variants Create Synthetic Genome-Wide Associations
Top Cited Papers
Open Access
- 26 January 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 8 (1) , e1000294
- https://doi.org/10.1371/journal.pbio.1000294
Abstract
Genome-wide association studies (GWAS) have now identified at least 2,000 common variants that appear associated with common diseases or related traits (http://www.genome.gov/gwastudies), hundreds of which have been convincingly replicated. It is generally thought that the associated markers reflect the effect of a nearby common (minor allele frequency >0.05) causal site, which is associated with the marker, leading to extensive resequencing efforts to find causal sites. We propose as an alternative explanation that variants much less common than the associated one may create “synthetic associations” by occurring, stochastically, more often in association with one of the alleles at the common site versus the other allele. Although synthetic associations are an obvious theoretical possibility, they have never been systematically explored as a possible explanation for GWAS findings. Here, we use simple computer simulations to show the conditions under which such synthetic associations will arise and how they may be recognized. We show that they are not only possible, but inevitable, and that under simple but reasonable genetic models, they are likely to account for or contribute to many of the recently identified signals reported in genome-wide association studies. We also illustrate the behavior of synthetic associations in real datasets by showing that rare causal mutations responsible for both hearing loss and sickle cell anemia create genome-wide significant synthetic associations, in the latter case extending over a 2.5-Mb interval encompassing scores of “blocks” of associated variants. In conclusion, uncommon or rare genetic variants can easily create synthetic associations that are credited to common variants, and this possibility requires careful consideration in the interpretation and follow up of GWAS signals. It has long been assumed that common genetic variants of modest effect make an important contribution to common human diseases, such as most forms of cardiovascular disease, asthma, and neuropsychiatric disease. Genome-wide scans evaluating the role of common variation have now been completed for all common disease using technology that claims to capture greater than 90% of common variants in major human populations. Surprisingly, the proportion of variation explained by common variation appears to be very modest, and moreover, there are very few examples of the actual variant being identified. At the same time, rare variants have been found with very large effects. Now it is demonstrated in a simulation study that even those signals that have been detected for common variants could, in principle, come from the effect of rare ones. This has important implications for our understanding of the genetic architecture of human disease and in the design of future studies to detect causal genetic variants.Keywords
This publication has 39 references indexed in Scilit:
- Personal genomes: The case of the missing heritabilityNature, 2008
- Large recurrent microdeletions associated with schizophreniaNature, 2008
- Forty-six genes causing nonsyndromic hearing impairment: Which ones should be analyzed in DNA diagnostics?Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis, 2008
- Association Analysis in African Americans of European-Derived Type 2 Diabetes Single Nucleotide Polymorphisms From Whole-Genome Association StudiesDiabetes, 2008
- Genetic analysis of reelin gene (RELN) SNPs: No association with autism spectrum disorder in the Indian populationNeuroscience Letters, 2008
- Deletion of CFHR3 and CFHR1 genes in age-related macular degenerationHuman Molecular Genetics, 2007
- Large-scale genetic fine mapping and genotype-phenotype associations implicate polymorphism in the IL2RA region in type 1 diabetesNature Genetics, 2007
- GENOME: a rapid coalescent-based whole genome simulatorBioinformatics, 2007
- GJB2 (connexin 26) variants and nonsyndromic sensorineural hearing loss: A HuGE reviewGenetics in Medicine, 2002
- A Deletion Involving the Connexin 30 Gene in Nonsyndromic Hearing ImpairmentNew England Journal of Medicine, 2002