THE AUTOMATION AND EVALUATION OF NESTED CLADE PHYLOGEOGRAPHIC ANALYSIS
- 1 June 2007
- Vol. 61 (6) , 1466-1480
- https://doi.org/10.1111/j.1558-5646.2007.00124.x
Abstract
Nested clade phylogeographic analysis (NCPA) is a popular method for reconstructing the demographic history of spatially distributed populations from genetic data. Although some parts of the analysis are automated, there is no unique and widely followed algorithm for doing this in its entirety, beginning with the data, and ending with the inferences drawn from the data. This article describes a method that automates NCPA, thereby providing a framework for replicating analyses in an objective way. To do so, a number of decisions need to be made so that the automated implementation is representative of previous analyses. We review how the NCPA procedure has evolved since its inception and conclude that there is scope for some variability in the manual application of NCPA. We apply the automated software to three published datasets previously analyzed manually and replicate many details of the manual analyses, suggesting that the current algorithm is representative of how a typical user will perform NCPA. We simulate a large number of replicate datasets for geographically distributed, but entirely random-mating, populations. These are then analyzed using the automated NCPA algorithm. Results indicate that NCPA tends to give a high frequency of false positives. In our simulations we observe that 14% of the clades give a conclusive inference that a demographic event has occurred, and that 75% of the datasets have at least one clade that gives such an inference. This is mainly due to the generation of multiple statistics per clade, of which only one is required to be significant to apply the inference key. We survey the inferences that have been made in recent publications and show that the most commonly inferred processes (restricted gene flow with isolation by distance and contiguous range expansion) are those that are commonly inferred in our simulations. However, published datasets typically yield a richer set of inferences with NCPA than obtained in our random-mating simulations, and further testing of NCPA with models of structured populations is necessary to examine its accuracy.Keywords
This publication has 49 references indexed in Scilit:
- Nested clade analysis statisticsMolecular Ecology Notes, 2006
- Species status and phylogeography of two closely related Coptolabrus species (Coleoptera: Carabidae) in South Korea inferred from mitochondrial and nuclear gene sequencesMolecular Ecology, 2005
- ABDOMINAL PIGMENTATION VARIATION IN DROSOPHILA POLYMORPHA: GEOGRAPHIC VARIATION IN THE TRAIT, AND UNDERLYING PHYLOGEOGRAPHYEvolution, 2005
- Tree ScanningGenetics, 2005
- LINEAR HABITATS AND THE NESTED CLADE ANALYSIS: AN EMPIRICAL EVALUATION OF GEOGRAPHIC VERSUS RIVER DISTANCES USING AN OZARK CRAYFISH (DECAPODA: CAMBARIDAE)Evolution, 2003
- The study of structured populations — new hope for a difficult and divided scienceNature Reviews Genetics, 2003
- PHYLOGEOGRAPHIC HISTORY OF THE LAND SNAIL CANDIDULA UNIFASCIATA (HELICELLINAE, STYLOMMATOPHORA): FRAGMENTATION, CORRIDOR MIGRATION, AND SECONDARY CONTACTEvolution, 2002
- Out of Africa again and againNature, 2002
- Uses of Evolutionary Theory in the Human Genome ProjectAnnual Review of Ecology and Systematics, 1999
- Mitochondrial DNA and human evolutionNature, 1987