Gene Sampling Strategies for Multi-Locus Population Estimates of Genetic Diversity (θ)
Open Access
- 17 January 2007
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 2 (1) , e160
- https://doi.org/10.1371/journal.pone.0000160
Abstract
Theoretical work suggests that data from multiple nuclear loci provide better estimates of population genetic parameters than do single loci, but just how many loci are needed and how much sequence is required from each has been little explored. To investigate how much data is required to estimate the population genetic parameter θ (4Neμ) accurately under ideal circumstances, we simulated datasets of DNA sequences under three values of θ per site (0.1, 0.01, 0.001), varying in both the total number of base pairs sequenced per individual and the number of equal-length loci. From these datasets we estimated θ using the maximum likelihood coalescent framework implemented in the computer program Migrate. Our results corroborated the theoretical expectation that increasing the number of loci impacted the accuracy of the estimate more than increasing the sequence length at single loci. However, when the value of θ was low (0.001), the per-locus sequence length was also important for estimating θ accurately, something that has not been emphasized in previous work. Accurate estimation of θ required data from at least 25 independently evolving loci. Beyond this, there was little added benefit in terms of decreasing the squared coefficient of variation of the coalescent estimates relative to the extra effort required to sample more loci.Keywords
This publication has 66 references indexed in Scilit:
- Multilocus Analysis of Introgression Between Two Sympatric Sister Species of Drosophila:Drosophila yakubaandD. santomeaGenetics, 2005
- Phylogeography of sexualHeteronotia binoei(Gekkonidae) in the Australian arid zone: climatic cycling and repetitive hybridizationMolecular Ecology, 2005
- Genealogical trees, coalescent theory and the analysis of genetic polymorphismsNature Reviews Genetics, 2002
- Revealing the demographic histories of species using DNA sequencesTrends in Ecology & Evolution, 2001
- PERSPECTIVE: GENE DIVERGENCE, POPULATION DIVERGENCE, AND THE VARIANCE IN COALESCENCE TIME IN PHYLOGEOGRAPHIC STUDIESEvolution, 2000
- MECHANISMS OF POPULATION DIFFERENTIATION IN MARBLED MURRELETS: HISTORICAL VERSUS CONTEMPORARY PROCESSESEvolution, 2000
- Testing Models of Migration and Isolation Among Populations of Chinook Salmon (Oncorhynchus tschawytscha)Evolution, 1998
- Inferring Phylogenies from mtDNA Variation: Mitochondrial-Gene Trees Versus Nuclear-Gene TreesEvolution, 1995
- Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimatesGenetics Research, 1992
- Biometry.Systematic Zoology, 1982