Conjugate Gibbs Sampling for Bayesian Phylogenetic Models
- 1 December 2006
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 13 (10) , 1701-1722
- https://doi.org/10.1089/cmb.2006.13.1701
Abstract
We propose a new Markov Chain Monte Carlo (MCMC) sampling mechanism for Bayesian phylogenetic inference. This method, which we call conjugate Gibbs, relies on analytical conjugacy properties, and is based on an alternation between data augmentation and Gibbs sampling. The data augmentation step consists in sampling a detailed substitution history for each site, and across the whole tree, given the current value of the model parameters. Provided convenient priors are used, the parameters of the model can then be directly updated by a Gibbs sampling procedure, conditional on the current substitution history. Alternating between these two sampling steps yields a MCMC device whose equilibrium distribution is the posterior probability density of interest. We show, on real examples, that this conjugate Gibbs method leads to a significant improvement of the mixing behavior of the MCMC. In all cases, the decorrelation times of the resulting chains are smaller than those obtained by standard Metropolis Hastings procedures by at least one order of magnitude. The method is particularly well suited to heterogeneous models, i.e. assuming site-specific random variables. In particular, the conjugate Gibbs formalism allows one to propose efficient implementations of complex models, for instance assuming site-specific substitution processes, that would not be accessible to standard MCMC methods.Keywords
This publication has 20 references indexed in Scilit:
- Inferring Complex DNA Substitution Processes on Phylogenies Using Uniformization and Data AugmentationSystematic Biology, 2006
- An Empirical Assessment of Long-Branch Attraction Artefacts in Deep Eukaryotic PhylogenomicsSystematic Biology, 2005
- The real ‘kingdoms’ of eukaryotesCurrent Biology, 2004
- A Bayesian Mixture Model for Across-Site Heterogeneities in the Amino-Acid Replacement ProcessMolecular Biology and Evolution, 2004
- Markov Chain Sampling Methods for Dirichlet Process Mixture ModelsJournal of Computational and Graphical Statistics, 2000
- Among-site rate variation and its impact on phylogenetic analysesTrends in Ecology & Evolution, 1996
- Marginal Likelihood from the Gibbs OutputJournal of the American Statistical Association, 1995
- Estimating the pattern of nucleotide substitutionJournal of Molecular Evolution, 1994
- Inference from Iterative Simulation Using Multiple SequencesStatistical Science, 1992
- Monte Carlo sampling methods using Markov chains and their applicationsBiometrika, 1970