Number of SNPS Loci Needed to Detect Population Structure
- 1 August 2003
- journal article
- research article
- Published by S. Karger AG in Human Heredity
- Vol. 55 (1) , 37-45
- https://doi.org/10.1159/000071808
Abstract
The study of the association of polymorphic genetic markers with common diseases is one of the most powerful tools in modern genetics. Interest in single nucleotide polymorphisms (SNPs) has steadily grown over the last decade. SNPs are currently the most developed markers in the human genome because they have a number of advantages over other marker types. One of the critical problems responsible for ‘spurious’ association findings in case-control studies is population stratification. There are many statistical approaches developed for detecting population heterogeneity. However the power to detect population structure by known methods is highly dependent on the number of loci utilised. We performed an analysis of SNPs data available in the public domain from The Single Nucleotide Consortia Ltd. (TSCL). Three populations, Afro-American, Asian and Caucasian, were compared. Estimation of the minimum number of SNPs loci necessary for detection of the population structure was performed. Two clustering approaches, distance-based and model-based, were compared. The model-based approach was superior when compared with the distance-based method. We found more than 65 random SNPs loci are required for identifying distinct geographically separated populations. Increasing the number of markers to over 100 raises the probability of correct assignment of a particular individual to an origin group to over 90%, even with conventional clustering methods.Keywords
This publication has 16 references indexed in Scilit:
- Human Population Genetic Structure and Inference of Group MembershipAmerican Journal of Human Genetics, 2003
- Genetic Structure of Human PopulationsScience, 2002
- Interrogating a High-Density SNP Map for Signatures of Natural SelectionGenome Research, 2002
- Testing for Population Subdivision and Association in Four Case-Control StudiesAmerican Journal of Human Genetics, 2002
- The Structure of Haplotype Blocks in the Human GenomeScience, 2002
- Patterns of Human Diversity, within and among Continents, Inferred from Biallelic DNA PolymorphismsGenome Research, 2002
- The Discovery of Single-Nucleotide Polymorphisms—and Inferences about Human Demographic HistoryAmerican Journal of Human Genetics, 2001
- Blocks of Limited Haplotype Diversity Revealed by High-Resolution Scanning of Human Chromosome 21Science, 2001
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Association Mapping in Structured PopulationsAmerican Journal of Human Genetics, 2000