Centralizing the non‐central chi‐square: a new method to correct for population stratification in genetic case‐control association studies

24 February 2006

journal article
research article
Published by Wiley in Genetic Epidemiology

Vol. 30 (4) , 277-289
https://doi.org/10.1002/gepi.20143

Abstract

We present a new method, the δ‐centralization (DC) method, to correct for population stratification (PS) in case‐control association studies. DC works well even when there is a lot of confounding due to PS. The latter causes overdispersion in the usual chi‐square statistics which then have non‐central chi‐square distributions. Other methods approach the non‐centrality indirectly, but we deal with it directly, by estimating the non‐centrality parameter τ itself. Specifically: (1) We define a quantity δ, a function of the relevant subpopulation parameters. We show that, for relatively large samples, δ exactly predicts the elevation of the false positive rate due to PS, when there is no true association between marker genotype and disease. (This quantity δ is quite different from Wright's F_ST and can be large even when F_ST is small.) (2) We show how to estimate δ, using a panel of unlinked “neutral” loci. (3) We then show that δ² corresponds to τ the non‐centrality parameter of the chi‐square distribution. Thus, we can centralize the chi‐square using our estimate of δ; this is the DC method. (4) We demonstrate, via computer simulations, that DC works well with as few as 25–30 unlinked markers, where the markers are chosen to have allele frequencies reasonably close (within ±.1) to those at the test locus. (5) We compare DC with genomic control and show that where as the latter becomes overconservative when there is considerable confounding due to PS (i.e. when δ is large), DC performs well for all values of δ. Genet. Epidemiol. 2006.

Keywords

This publication has 19 references indexed in Scilit:

FAST‐TRACK: Integrating QTL mapping and genome scans towards the characterization of candidate loci under parallel selection in the lake whitefish (Coregonus clupeaformis)
Molecular Ecology, 2004
Effect of Population Stratification on Case-Control Association Studies
Human Heredity, 2004
Effect of Population Stratification on Case-Control Association Studies
Human Heredity, 2004
Genomic Control to the extreme
Nature Genetics, 2004
Reply to "Genomic Control to the extreme"
Nature Genetics, 2004
Case-Control Association Studies in Mixed Populations: Correcting Using Genomic Control
Human Heredity, 2004
Accounting for Unmeasured Population Substructure in Case-Control Studies of Genetic Association Using a Novel Latent-Class Model
American Journal of Human Genetics, 2001
Association Mapping in Structured Populations
American Journal of Human Genetics, 2000
Use of Unlinked Genetic Markers to Detect Population Stratification in Association Studies
American Journal of Human Genetics, 1999
THE GENETICAL STRUCTURE OF POPULATIONS
Annals of Eugenics, 1949