Using Classical Population Genetics Tools with Heterochroneous Data: Time Matters!
Open Access
- 14 May 2009
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 4 (5) , e5541
- https://doi.org/10.1371/journal.pone.0005541
Abstract
New polymorphism datasets from heterochroneous data have arisen thanks to recent advances in experimental and microbial molecular evolution, and the sequencing of ancient DNA (aDNA). However, classical tools for population genetics analyses do not take into account heterochrony between subsets, despite potential bias on neutrality and population structure tests. Here, we characterize the extent of such possible biases using serial coalescent simulations. We first use a coalescent framework to generate datasets assuming no or different levels of heterochrony and contrast most classical population genetic statistics. We show that even weak levels of heterochrony (∼10% of the average depth of a standard population tree) affect the distribution of polymorphism substantially, leading to overestimate the level of polymorphism θ, to star like trees, with an excess of rare mutations and a deficit of linkage disequilibrium, which are the hallmark of e.g. population expansion (possibly after a drastic bottleneck). Substantial departures of the tests are detected in the opposite direction for more heterochroneous and equilibrated datasets, with balanced trees mimicking in particular population contraction, balancing selection, and population differentiation. We therefore introduce simple corrections to classical estimators of polymorphism and of the genetic distance between populations, in order to remove heterochrony-driven bias. Finally, we show that these effects do occur on real aDNA datasets, taking advantage of the currently available sequence data for Cave Bears (Ursus spelaeus), for which large mtDNA haplotypes have been reported over a substantial time period (22–130 thousand years ago (KYA)). Considering serial sampling changed the conclusion of several tests, indicating that neglecting heterochrony could provide significant support for false past history of populations and inappropriate conservation decisions. We therefore argue for systematically considering heterochroneous models when analyzing heterochroneous samples covering a large time scale.Keywords
This publication has 59 references indexed in Scilit:
- Summary statistics of neutral mutations in longitudinal DNA samplesTheoretical Population Biology, 2008
- Surprising migration and population size dynamics in ancient Iberian brown bears ( Ursus arctos )Proceedings of the National Academy of Sciences, 2008
- Barking up the wrong tree: Modern northern European dogs fail to explain their originBMC Ecology and Evolution, 2008
- Elevated substitution rates estimated from ancient DNA sequencesBiology Letters, 2007
- Ancient Biomolecules from Deep Ice Cores Reveal a Forested Southern GreenlandScience, 2007
- Typing single polymorphic nucleotides in mitochondrial DNA as a way to access Middle Pleistocene DNABiology Letters, 2006
- Serial coalescent simulations suggest a weak genealogical relationship between Etruscans and modern TuscansProceedings of the National Academy of Sciences, 2006
- Bayesian Estimation of the Timing and Severity of a Population Bottleneck from Ancient DNAPLoS Genetics, 2006
- Genetic Response to Climatic Change: Insights from Ancient DNA and PhylochronologyPLoS Biology, 2004
- No Evidence of Neandertal mtDNA Contribution to Early Modern HumansPLoS Biology, 2004