Phylogenetic Mixture Models Can Reduce Node-Density Artifacts

Open Access

1 April 2008

journal article
Published by Oxford University Press (OUP) in Systematic Biology

Vol. 57 (2) , 286-293
https://doi.org/10.1080/10635150802044045

Abstract

We investigate the performance of phylogenetic mixture models in reducing a well-known and pervasive artifact of phylogenetic inference known as the node-density effect, comparing them to partitioned analyses of the same data. The node-density effect refers to the tendency for the amount of evolutionary change in longer branches of phylogenies to be underestimated compared to that in regions of the tree where there are more nodes and thus branches are typically shorter. Mixture models allow more than one model of sequence evolution to describe the sites in an alignment without prior knowledge of the evolutionary processes that characterize the data or how they correspond to different sites. If multiple evolutionary patterns are common in sequence evolution, mixture models may be capable of reducing node-density effects by characterizing the evolutionary processes more accurately. In gene-sequence alignments simulated to have heterogeneous patterns of evolution, we find that mixture models can reduce node-density effects to negligible levels or remove them altogether, performing as well as partitioned analyses based on the known simulated patterns. The mixture models achieve this without knowledge of the patterns that generated the data and even in some cases without specifying the full or true model of sequence evolution known to underlie the data. The latter result is especially important in real applications, as the true model of evolution is seldom known. We find the same patterns of results for two real data sets with evidence of complex patterns of sequence evolution: mixture models substantially reduced node-density effects and returned better likelihoods compared to partitioning models specifically fitted to these data. We suggest that the presence of more than one pattern of evolution in the data is a common source of error in phylogenetic inference and that mixture models can often detect these patterns even without prior knowledge of their presence in the data. Routine use of mixture models alongside other approaches to phylogenetic inference may often reveal hidden or unexpected patterns of sequence evolution and can improve phylogenetic inference.

Keywords

This publication has 28 references indexed in Scilit:

MODEL MISSPECIFICATION NOT THE NODE-DENSITY ARTIFACT
Evolution, 2008
THE LIKELIHOOD NODE DENSITY EFFECT AND CONSEQUENCES FOR EVOLUTIONARY STUDIES OF MOLECULAR RATES
Evolution, 2007
Incorporating Molecular Evolution into Phylogenetic Analysis, and a New Compilation of Conserved Polymerase Chain Reaction Primers for Animal Mitochondrial DNA
Annual Review of Ecology, Evolution, and Systematics, 2006
Variation in Evolutionary Processes at Different Codon Positions
Molecular Biology and Evolution, 2006
Comment on "Phylogenetic MCMC Algorithms Are Misleading on Mixtures of Trees"
Science, 2006
Heterotachy and Tree Building: A Case Study with Plastids and Eubacteria
Molecular Biology and Evolution, 2005
A Bayesian Mixture Model for Across-Site Heterogeneities in the Amino-Acid Replacement Process
Molecular Biology and Evolution, 2004
Molecular Phylogenies Link Rates of Evolution and Speciation
Science, 2003
Inferring evolutionary processes from phylogenies
Zoologica Scripta, 1997
Reversible jump Markov chain Monte Carlo computation and Bayesian model determination
Biometrika, 1995