Improving Marginal Likelihood Estimation for Bayesian Phylogenetic Model Selection
Top Cited Papers
Open Access
- 27 December 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 60 (2) , 150-160
- https://doi.org/10.1093/sysbio/syq085
Abstract
The marginal likelihood is commonly used for comparing different evolutionary models in Bayesian phylogenetics and is the central quantity used in computing Bayes Factors for comparing model fit. A popular method for estimating marginal likelihoods, the harmonic mean (HM) method, can be easily computed from the output of a Markov chain Monte Carlo analysis but often greatly overestimates the marginal likelihood. The thermodynamic integration (TI) method is much more accurate than the HM method but requires more computation. In this paper, we introduce a new method, steppingstone sampling (SS), which uses importance sampling to estimate each ratio in a series (the “stepping stones”) bridging the posterior and prior distributions. We compare the performance of the SS approach to the TI and HM methods in simulation and using real data. We conclude that the greatly increased accuracy of the SS and TI methods argues for their use instead of the HM method, despite the extra computation needed.Keywords
This publication has 27 references indexed in Scilit:
- Choosing among Partition Models in Bayesian PhylogeneticsMolecular Biology and Evolution, 2010
- Phylogeny and biogeography of bees of the tribe Osmiini (Hymenoptera: Megachilidae)Molecular Phylogenetics and Evolution, 2008
- Marginal Likelihood Estimation via Power PosteriorsJournal of the Royal Statistical Society Series B: Statistical Methodology, 2008
- A General Comparison of Relaxed Molecular Clock ModelsMolecular Biology and Evolution, 2007
- Bayesian Phylogenetic Model Selection Using Reversible Jump Markov Chain Monte CarloMolecular Biology and Evolution, 2004
- Phylogenetic Tree Construction Using Markov Chain Monte CarloJournal of the American Statistical Association, 2000
- Simulating normalizing constants: from importance sampling to bridge sampling to path samplingStatistical Science, 1998
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974
- The Large-Sample Distribution of the Likelihood Ratio for Testing Composite HypothesesThe Annals of Mathematical Statistics, 1938