Phylogenetic Mixture Models Can Reduce Node-Density Artifacts
Open Access
- 1 April 2008
- journal article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 57 (2) , 286-293
- https://doi.org/10.1080/10635150802044045
Abstract
We investigate the performance of phylogenetic mixture models in reducing a well-known and pervasive artifact of phylogenetic inference known as the node-density effect, comparing them to partitioned analyses of the same data. The node-density effect refers to the tendency for the amount of evolutionary change in longer branches of phylogenies to be underestimated compared to that in regions of the tree where there are more nodes and thus branches are typically shorter. Mixture models allow more than one model of sequence evolution to describe the sites in an alignment without prior knowledge of the evolutionary processes that characterize the data or how they correspond to different sites. If multiple evolutionary patterns are common in sequence evolution, mixture models may be capable of reducing node-density effects by characterizing the evolutionary processes more accurately. In gene-sequence alignments simulated to have heterogeneous patterns of evolution, we find that mixture models can reduce node-density effects to negligible levels or remove them altogether, performing as well as partitioned analyses based on the known simulated patterns. The mixture models achieve this without knowledge of the patterns that generated the data and even in some cases without specifying the full or true model of sequence evolution known to underlie the data. The latter result is especially important in real applications, as the true model of evolution is seldom known. We find the same patterns of results for two real data sets with evidence of complex patterns of sequence evolution: mixture models substantially reduced node-density effects and returned better likelihoods compared to partitioning models specifically fitted to these data. We suggest that the presence of more than one pattern of evolution in the data is a common source of error in phylogenetic inference and that mixture models can often detect these patterns even without prior knowledge of their presence in the data. Routine use of mixture models alongside other approaches to phylogenetic inference may often reveal hidden or unexpected patterns of sequence evolution and can improve phylogenetic inference.Keywords
This publication has 28 references indexed in Scilit:
- MODEL MISSPECIFICATION NOT THE NODE-DENSITY ARTIFACTEvolution, 2008
- THE LIKELIHOOD NODE DENSITY EFFECT AND CONSEQUENCES FOR EVOLUTIONARY STUDIES OF MOLECULAR RATESEvolution, 2007
- Incorporating Molecular Evolution into Phylogenetic Analysis, and a New Compilation of Conserved Polymerase Chain Reaction Primers for Animal Mitochondrial DNAAnnual Review of Ecology, Evolution, and Systematics, 2006
- Variation in Evolutionary Processes at Different Codon PositionsMolecular Biology and Evolution, 2006
- Comment on "Phylogenetic MCMC Algorithms Are Misleading on Mixtures of Trees"Science, 2006
- Heterotachy and Tree Building: A Case Study with Plastids and EubacteriaMolecular Biology and Evolution, 2005
- A Bayesian Mixture Model for Across-Site Heterogeneities in the Amino-Acid Replacement ProcessMolecular Biology and Evolution, 2004
- Molecular Phylogenies Link Rates of Evolution and SpeciationScience, 2003
- Inferring evolutionary processes from phylogeniesZoologica Scripta, 1997
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 1995