Computing Bayes Factors Using Thermodynamic Integration
Top Cited Papers
Open Access
- 1 April 2006
- journal article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 55 (2) , 195-207
- https://doi.org/10.1080/10635150500433722
Abstract
In the Bayesian paradigm, a common method for comparing two models is to compute the Bayes factor, defined as the ratio of their respective marginal likelihoods. In recent phylogenetic works, the numerical evaluation of marginal likelihoods has often been performed using the harmonic mean estimation procedure. In the present article, we propose to employ another method, based on an analogy with statistical physics, called thermodynamic integration. We describe the method, propose an implementation, and show on two analytical examples that this numerical method yields reliable estimates. In contrast, the harmonic mean estimator leads to a strong overestimation of the marginal likelihood, which is all the more pronounced as the model is higher dimensional. As a result, the harmonic mean estimator systematically favors more parameter-rich models, an artefact that might explain some recent puzzling observations, based on harmonic mean estimates, suggesting that Bayes factors tend to overscore complex models. Finally, we apply our method to the comparison of several alternative models of amino-acid replacement. We confirm our previous observations, indicating that modeling pattern heterogeneity across sites tends to yield better models than standard empirical matrices.Keywords
This publication has 49 references indexed in Scilit:
- Multigene Analyses of Bilaterian Animals Corroborate the Monophyly of Ecdysozoa, Lophotrochozoa, and ProtostomiaMolecular Biology and Evolution, 2005
- The Intrinsic Bayes Factor for Model Selection and PredictionJournal of the American Statistical Association, 1996
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 1995
- Marginal Likelihood from the Gibbs OutputJournal of the American Statistical Association, 1995
- Bayes FactorsJournal of the American Statistical Association, 1995
- Computing Bayes Factors Using a Generalization of the Savage-Dickey Density RatioJournal of the American Statistical Association, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- [Practical Markov Chain Monte Carlo]: Comment: One Long Run with Diagnostics: Implementation Strategies for Markov Chain Monte CarloStatistical Science, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978