Microsatellites Are Molecular Clocks That Support Accurate Inferences about History
- 12 February 2009
- journal article
- review article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 26 (5) , 1017-1027
- https://doi.org/10.1093/molbev/msp025
Abstract
Microsatellite length mutations are often modeled using the generalized stepwise mutation process, which is a type of random walk. If this model is sufficiently accurate, one can estimate the coalescence time between alleles of a locus after a mathematical transformation of the allele lengths. When large-scale microsatellite genotyping first became possible, there was substantial interest in using this approach to make inferences about time and demography, but that interest has waned because it has not been possible to empirically validate the clock by comparing it with data in which the mutation process is well understood. We analyzed data from 783 microsatellite loci in human populations and 292 loci in chimpanzee populations, and compared them with up to one gigabase of aligned sequence data, where the molecular clock based upon nucleotide substitutions is believed to be reliable. We empirically demonstrate a remarkable linearity (r(2) > 0.95) between the microsatellite average square distance statistic and sequence divergence. We demonstrate that microsatellites are accurate molecular clocks for coalescent times of at least 2 million years (My). We apply this insight to confirm that the African populations San, Biaka Pygmy, and Mbuti Pygmy have the deepest coalescent times among populations in the Human Genome Diversity Project. Furthermore, we show that microsatellites support unbiased estimates of population differentiation (F(ST)) that are less subject to ascertainment bias than single nucleotide polymorphism (SNP) F(ST). These results raise the prospect of using microsatellite data sets to determine parameters of population history. When genotyped along with SNPs, microsatellite data can also be used to correct for SNP ascertainment bias.Keywords
This publication has 67 references indexed in Scilit:
- Accelerated genetic drift on chromosome X during the human dispersal out of AfricaNature Genetics, 2008
- ADZE: a rarefaction approach for counting alleles private to combinations of populationsBioinformatics, 2008
- Population differentiation and migration: Coalescence times in a two-sex island model for autosomal and X-linked lociPublished by Elsevier ,2008
- Worldwide Human Relationships Inferred from Genome-Wide Patterns of VariationScience, 2008
- Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in EuropeansNature Genetics, 2007
- Genetic Structure of Chimpanzee PopulationsPLoS Genetics, 2007
- A worldwide survey of haplotype variation and linkage disequilibrium in the human genomeNature Genetics, 2006
- Standardized Subsets of the HGDP‐CEPH Human Genome Diversity Cell Line Panel, Accounting for Atypical and Duplicated Samples and Pairs of Close RelativesAnnals of Human Genetics, 2006
- Microsatellites: simple sequences with complex evolutionNature Reviews Genetics, 2004
- Initial sequencing and analysis of the human genomeNature, 2001