Phylogenetic MCMC Algorithms Are Misleading on Mixtures of Trees

30 September 2005

journal article
other
Published by American Association for the Advancement of Science (AAAS) in Science

Vol. 309 (5744) , 2207-2209
https://doi.org/10.1126/science.1115493

Abstract

Markov chain Monte Carlo (MCMC) algorithms play a critical role in the Bayesian approach to phylogenetic inference. We present a theoretical analysis of the rate of convergence of many of the widely used Markov chains. For N characters generated from a uniform mixture of two trees, we prove that the Markov chains take an exponentially long (in N) number of iterations to converge to the posterior distribution. Nevertheless, the likelihood plots for sample runs of the Markov chains deceivingly suggest that the chains converge rapidly to a unique tree. Our results rely on novel mathematical understanding of the log-likelihood function on the space of phylogenetic trees. The practical implications of our work are that Bayesian MCMC methods can be misleading when the data are generated from a mixture of trees. Thus, in cases of data containing potentially conflicting phylogenetic signals, phylogenetic reconstruction should be performed separately on each signal.

Keywords

This publication has 22 references indexed in Scilit:

Should phylogenetic models be trying to ‘fit an elephant’?
Trends in Genetics, 2005
Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneous
Nature, 2004
Bayesian Inference of Phylogeny and Its Impact on Evolutionary Biology
Science, 2001
Phylogenetic Tree Construction Using Markov Chain Monte Carlo
Journal of the American Statistical Association, 2000
Complexity of the simplest phylogenetic estimation problem
Proceedings Of The Royal Society B-Biological Sciences, 2000
Markov Chasin Monte Carlo Algorithms for the Bayesian Analysis of Phylogenetic Trees
Molecular Biology and Evolution, 1999
Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across characters
Mathematical Biosciences, 1996
Reconstructing Trees When Sequence Sites Evolve at Variable Rates
Journal of Computational Biology, 1994
Taxonomy with confidence
Mathematical Biosciences, 1978
A Probability Model for Inferring Evolutionary Trees
Systematic Zoology, 1973