Population genetic inference using a fixed number of segregating sites: a reassessment

1 February 2007

journal article
research article
Published by Hindawi Limited in Genetics Research

Vol. 89 (4) , 231-244
https://doi.org/10.1017/s0016672307008877

Abstract

Coalescent theory is commonly used to perform population genetic inference at the nucleotide level. Here, we examine the procedure that fixes the number of segregating sites (henceforth theFSprocedure). In this approach a fixed number of segregating sites (S) are placed on a coalescent tree (independently of the total and internode lengths of the tree). Thus, although widely used, theFSprocedure does not strictly follow the assumptions of coalescent theory and must be considered an approximation of (i) the standard procedure that uses a fixed population mutation parameter θ, and (ii) procedures that condition on the number of segregating sites. We study the differences in the false positive rate for nine statistics by comparing theFSprocedure with the procedures (i) and (ii), using several evolutionary models with single-locus and multilocus data. Our results indicate that for single-locus data theFSprocedure is accurate for the equilibrium neutral model, but problems arise under the alternative models studied; furthermore, for multilocus data, theFSprocedure becomes inaccurate even for the standard neutral model. Therefore, we recommend a procedure that fixes the θ value (or alternatively, procedures that condition onSand take into account the uncertainty of θ) for analysing evolutionary models with multilocus data. With single-locus data, theFSprocedure should not be employed for models other than the standard neutral model.

Keywords

This publication has 26 references indexed in Scilit:

The Effects of Artificial Selection on the Maize Genome
Science, 2005
Detecting Selective Sweeps with Haplotype Tests
Published by Springer Nature ,2005
Demography and Natural Selection Have Shaped Genetic Variation in Drosophila melanogaster: A Multi-locus Approach
Genetics, 2003
Coalescent Simulations and Statistical Tests of Neutrality
Molecular Biology and Evolution, 2001
Haplotype Tests Using Coalescent Simulations Conditional on the Number of Segregating Sites
Molecular Biology and Evolution, 2001
Recombination and the power of statistical tests of neutrality
Genetics Research, 1999
Neutrality tests based on the distribution of haplotypes under an infinite-site model
Molecular Biology and Evolution, 1998
The effect of strongly selected substitutions on neutral polymorphism: Analytical results based on diffusion theory
Theoretical Population Biology, 1992
On the number of segregating sites in genetical models without recombination
Theoretical Population Biology, 1975
ISOLATION BY DISTANCE
Genetics, 1943