How Robust Are "Isolation with Migration" Analyses to Violations of the IM Model? A Simulation Study
Top Cited Papers
Open Access
- 30 September 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 27 (2) , 297-310
- https://doi.org/10.1093/molbev/msp233
Abstract
Methods developed over the past decade have made it possible to estimate molecular demographic parameters such as effective population size, divergence time, and gene flow with unprecedented accuracy and precision. However, they make simplifying assumptions about certain aspects of the species’ histories and the nature of the genetic data, and it is not clear how robust they are to violations of these assumptions. Here, we use simulated data sets to examine the effects of a number of violations of the “Isolation with Migration” (IM) model, including intralocus recombination, population structure, gene flow from an unsampled species, linkage among loci, and divergent selection, on demographic parameter estimates made using the program IMA. We also examine the effect of having data that fit a nucleotide substitution model other than the two relatively simple models available in IMA. We find that IMA estimates are generally quite robust to small to moderate violations of the IM model assumptions, comparable with what is often encountered in real-world scenarios. In particular, population structure within species, a condition encountered to some degree in virtually all species, has little effect on parameter estimates even for fairly high levels of structure. Likewise, most parameter estimates are robust to significant levels of recombination when data sets are pared down to apparently nonrecombining blocks, although substantial bias is introduced to several estimates when the entire data set with recombination is included. In contrast, a poor fit to the nucleotide substitution model can result in an increased error rate, in some cases due to a predictable bias and in other cases due to an increase in variance in parameter estimates among data sets simulated under the same conditions.Keywords
This publication has 93 references indexed in Scilit:
- Genomic Patterns of Adaptive Divergence between Chromosomally Differentiated Sunflower SpeciesMolecular Biology and Evolution, 2009
- In situ genetic differentiation in a Hispaniolan lizard (Ameiva chrysolaema): A multilocus perspectiveMolecular Phylogenetics and Evolution, 2008
- The genic view of plant speciation: recent progress and emerging questionsPhilosophical Transactions Of The Royal Society B-Biological Sciences, 2008
- The genomic and epidemiological dynamics of human influenza A virusNature, 2008
- Recent divergence with gene flow in Tennessee cave salamanders (Plethodontidae:Gyrinophilus) inferred from gene genealogiesMolecular Ecology, 2008
- Surprising migration and population size dynamics in ancient Iberian brown bears ( Ursus arctos )Proceedings of the National Academy of Sciences, 2008
- DNA evidence for historic population size and past ecosystem impacts of gray whalesProceedings of the National Academy of Sciences, 2007
- A new approach to estimate parameters of speciation models with application to apesGenome Research, 2007
- Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population geneticsProceedings of the National Academy of Sciences, 2007
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991