Branch-Length Prior Influences Bayesian Posterior Probability of Phylogeny
Open Access
- 1 June 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 54 (3) , 455-470
- https://doi.org/10.1080/10635150590945313
Abstract
The Bayesian method for estimating species phylogenies from molecular sequence data provides an attractive alternative to maximum likelihood with nonparametric bootstrap due to the easy interpretation of posterior probabilities for trees and to availability of efficient computational algorithms. However, for many data sets it produces extremely high posterior probabilities, sometimes for apparently incorrect clades. Here we use both computer simulation and empirical data analysis to examine the effect of the prior model for internal branch lengths. We found that posterior probabilities for trees and clades are sensitive to the prior for internal branch lengths, and priors assuming long internal branches cause high posterior probabilities for trees. In particular, uniform priors with high upper bounds bias Bayesian clade probabilities in favor of extreme values. We discuss possible remedies to the problem, including empirical and full Bayesian methods and subjective procedures suggested in Bayesian hypothesis testing. Our results also suggest that the bootstrap proportion and Bayesian posterior probability are different measures of accuracy, and that the bootstrap proportion, if interpreted as the probability that the clade is true, can be either too liberal or too conservative.Keywords
This publication has 44 references indexed in Scilit:
- How Meaningful Are Bayesian Support Values?Molecular Biology and Evolution, 2004
- Comparison of Bayesian and Maximum Likelihood Bootstrap Measures of Phylogenetic ReliabilityMolecular Biology and Evolution, 2003
- Resolution of the Early Placental Mammal Radiation Using Bayesian PhylogeneticsScience, 2001
- Phylogenetic Tree Construction Using Markov Chain Monte CarloJournal of the American Statistical Association, 2000
- Complexity of the simplest phylogenetic estimation problemProceedings Of The Royal Society B-Biological Sciences, 2000
- Bootstrapping phylogenies: Large deviations and dispersion effectsBiometrika, 1996
- Lindley's ParadoxJournal of the American Statistical Association, 1982
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Bootstrap Methods: Another Look at the JackknifeThe Annals of Statistics, 1979
- A comment on D. V. Lindley's statistical paradoxBiometrika, 1957