Reconstituting the Frequency Spectrum of Ascertained Single-Nucleotide Polymorphism Data
Open Access
- 1 December 2004
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 168 (4) , 2373-2382
- https://doi.org/10.1534/genetics.104.031039
Abstract
Most of the available SNP data have eluded valid population genetic analysis because most population genetical methods do not correctly accommodate the special discovery process used to identify SNPs. Most of the available SNP data have allele frequency distributions that are biased by the ascertainment protocol. We here show how this problem can be corrected by obtaining maximum-likelihood estimates of the true allele frequency distribution. In simple cases, the ML estimate of the true allele frequency distribution can be obtained analytically, but in other cases computational methods based on numerical optimization or the EM algorithm must be used. We illustrate the new correction method by analyzing some previously published SNP data from the SNP Consortium. Appropriate treatment of SNP ascertainment is vital to our ability to make correct inferences from the data of the International HapMap Project.Keywords
This publication has 23 references indexed in Scilit:
- A 3.9-Centimorgan-Resolution Human Single-Nucleotide Polymorphism Linkage Map and Screening SetAmerican Journal of Human Genetics, 2003
- Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibriumTheoretical Population Biology, 2003
- The application of molecular genetic approaches to the study of human evolutionNature Genetics, 2003
- Interrogating a High-Density SNP Map for Signatures of Natural SelectionGenome Research, 2002
- Detecting recent positive selection in the human genome from haplotype structureNature, 2002
- The Discovery of Single-Nucleotide Polymorphisms—and Inferences about Human Demographic HistoryAmerican Journal of Human Genetics, 2001
- Haplotype Variation and Linkage Disequilibrium in 313 Human GenesScience, 2001
- SNP frequencies in human genes: an excess of rare alleles and differing modes of selectionTrends in Genetics, 2000
- Large-Scale Identification, Mapping, and Genotyping of Single-Nucleotide Polymorphisms in the Human GenomeScience, 1998
- The coalescentStochastic Processes and their Applications, 1982