Genotype‐based matching to correct for population stratification in large‐scale case‐control genetic association studies
- 23 January 2009
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 33 (6) , 508-517
- https://doi.org/10.1002/gepi.20403
Abstract
Genome‐wide association studies are helping to dissect the etiology of complex diseases. Although case‐control association tests are generally more powerful than family‐based association tests, population stratification can lead to spurious disease‐marker association or mask a true association. Several methods have been proposed to match cases and controls prior to genotyping, using family information or epidemiological data, or using genotype data for a modest number of genetic markers. Here, we describe a genetic similarity score matching (GSM) method for efficient matched analysis of cases and controls in a genome‐wide or large‐scale candidate gene association study. GSM comprises three steps: (1) calculating similarity scores for pairs of individuals using the genotype data; (2) matching sets of cases and controls based on the similarity scores so that matched cases and controls have similar genetic background; and (3) using conditional logistic regression to perform association tests. Through computer simulation we show that GSM correctly controls false‐positive rates and improves power to detect true disease predisposing variants. We compare GSM to genomic control using computer simulations, and find improved power using GSM. We suggest that initial matching of cases and controls prior to genotyping combined with careful re‐matching after genotyping is a method of choice for genome‐wide association studies.Genet. Epidemiol. 33:508–517, 2009.Keywords
This publication has 33 references indexed in Scilit:
- On the Use of General Control Samples for Genome-wide Association Studies: Genetic Matching Highlights Causal VariantsAmerican Journal of Human Genetics, 2008
- A Randomization Test for Controlling Population Stratification in Whole-Genome Association StudiesAmerican Journal of Human Genetics, 2007
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility VariantsScience, 2007
- A Simple and Improved Correction for Population Stratification in Case-Control StudiesAmerican Journal of Human Genetics, 2007
- A genome-wide association study identifies novel risk loci for type 2 diabetesNature, 2007
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- Centralizing the non‐central chi‐square: a new method to correct for population stratification in genetic case‐control association studiesGenetic Epidemiology, 2006
- Complement Factor H Polymorphism in Age-Related Macular DegenerationScience, 2005
- Assessing the impact of population stratification on genetic association studiesNature Genetics, 2004