Practical population group assignment with selected informative markers: Characteristics and properties of Bayesian clustering via STRUCTURE
- 21 March 2005
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 28 (4) , 302-312
- https://doi.org/10.1002/gepi.20070
Abstract
Population stratification, which is caused by population genetic substructure (PGS), is a critical issue for the design and interpretation of genetic association studies. Methods to address this problem have been devised, but little is known at this point about practical genotyping requirements for resolving PGS based on different marker characteristics. In this report, we seek to (1) identify a small, practical marker set to differentiate African Americans (AAs) from European Americans (EAs), and (2) assess the impact of marker efficiency and sample size on clustering individuals into subgroups by the methods of STRUCTURE (Pritchard et al., [2000a] Genetics 155:945–959). A panel of 37 markers was genotyped for 865 individuals (640 EAs and 225 AAs) from the Northeastern United States. Among EAs, the assignment accuracy reached >99% using only the 4 most efficient markers. Among AAs, the assignment accuracy exceeded 95% when using the 6 most informative markers. Smaller sample size increased the variance in population differentiation, rather than degrading the results consistently. We conclude that the use of marker‐efficiency measures for marker selection yielded a relatively small set of STR markers that were effective at differentiating EA and AA populations. The number of markers required is much lower than has been suggested in previous studies. Genet. Epidemiol. 2005.Keywords
This publication has 34 references indexed in Scilit:
- Detecting the number of clusters of individuals using the software structure: a simulation studyMolecular Ecology, 2005
- FAST‐TRACK: Integrating QTL mapping and genome scans towards the characterization of candidate loci under parallel selection in the lake whitefish (Coregonus clupeaformis)Molecular Ecology, 2004
- Design and Analysis of Admixture Mapping StudiesAmerican Journal of Human Genetics, 2004
- Assessing the impact of population stratification on genetic association studiesNature Genetics, 2004
- Informativeness of Genetic Markers for Inference of Ancestry*American Journal of Human Genetics, 2003
- Genetic Variation Among World Populations: Inferences From 100 Alu Insertion PolymorphismsGenome Research, 2003
- Genetic Structure of Human PopulationsScience, 2002
- Adjusting for population structure in admixed populationsGenetic Epidemiology, 2002
- Accounting for Unmeasured Population Substructure in Case-Control Studies of Genetic Association Using a Novel Latent-Class ModelAmerican Journal of Human Genetics, 2001
- Association Mapping in Structured PopulationsAmerican Journal of Human Genetics, 2000