Phylogenetic MCMC Algorithms Are Misleading on Mixtures of Trees
- 30 September 2005
- journal article
- other
- Published by American Association for the Advancement of Science (AAAS) in Science
- Vol. 309 (5744) , 2207-2209
- https://doi.org/10.1126/science.1115493
Abstract
Markov chain Monte Carlo (MCMC) algorithms play a critical role in the Bayesian approach to phylogenetic inference. We present a theoretical analysis of the rate of convergence of many of the widely used Markov chains. For N characters generated from a uniform mixture of two trees, we prove that the Markov chains take an exponentially long (in N) number of iterations to converge to the posterior distribution. Nevertheless, the likelihood plots for sample runs of the Markov chains deceivingly suggest that the chains converge rapidly to a unique tree. Our results rely on novel mathematical understanding of the log-likelihood function on the space of phylogenetic trees. The practical implications of our work are that Bayesian MCMC methods can be misleading when the data are generated from a mixture of trees. Thus, in cases of data containing potentially conflicting phylogenetic signals, phylogenetic reconstruction should be performed separately on each signal.Keywords
This publication has 22 references indexed in Scilit:
- Should phylogenetic models be trying to ‘fit an elephant’?Trends in Genetics, 2005
- Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneousNature, 2004
- Bayesian Inference of Phylogeny and Its Impact on Evolutionary BiologyScience, 2001
- Phylogenetic Tree Construction Using Markov Chain Monte CarloJournal of the American Statistical Association, 2000
- Complexity of the simplest phylogenetic estimation problemProceedings Of The Royal Society B-Biological Sciences, 2000
- Markov Chasin Monte Carlo Algorithms for the Bayesian Analysis of Phylogenetic TreesMolecular Biology and Evolution, 1999
- Inconsistency of evolutionary tree topology reconstruction methods when substitution rates vary across charactersMathematical Biosciences, 1996
- Reconstructing Trees When Sequence Sites Evolve at Variable RatesJournal of Computational Biology, 1994
- Taxonomy with confidenceMathematical Biosciences, 1978
- A Probability Model for Inferring Evolutionary TreesSystematic Zoology, 1973