Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo
Open Access
- 7 October 2008
- journal article
- Published by The Royal Society in Philosophical Transactions Of The Royal Society B-Biological Sciences
- Vol. 363 (1512) , 3955-3964
- https://doi.org/10.1098/rstb.2008.0178
Abstract
The rate at which a given site in a gene sequence alignment evolves over time may vary. This phenomenon—known as heterotachy—can bias or distort phylogenetic trees inferred from models of sequence evolution that assume rates of evolution are constant. Here, we describe a phylogenetic mixture model designed to accommodate heterotachy. The method sums the likelihood of the data at each site over more than one set of branch lengths on the same tree topology. A branch-length set that is best for one site may differ from the branch-length set that is best for some other site, thereby allowing different sites to have different rates of change throughout the tree. Because rate variation may not be present in all branches, we use a reversible-jump Markov chain Monte Carlo algorithm to identify those branches in which reliable amounts of heterotachy occur. We implement the method in combination with our ‘pattern-heterogeneity’ mixture model, applying it to simulated data and five published datasets. We find that complex evolutionary signals of heterotachy are routinely present over and above variation in the rate or pattern of evolution across sites, that the reversible-jump method requires far fewer parameters than conventional mixture models to describe it, and serves to identify the regions of the tree in which heterotachy is most pronounced. The reversible-jump procedure also removes the need for a posteriori tests of ‘significance’ such as the Akaike or Bayesian information criterion tests, or Bayes factors. Heterotachy has important consequences for the correct reconstruction of phylogenies as well as for tests of hypotheses that rely on accurate branch-length information. These include molecular clocks, analyses of tempo and mode of evolution, comparative studies and ancestral state reconstruction. The model is available from the authors' website, and can be used for the analysis of both nucleotide and morphological data.Keywords
This publication has 41 references indexed in Scilit:
- Changing Selective Pressure during Antigenic Changes in Human Influenza H3PLoS Pathogens, 2008
- A Mixed Branch Length Model of Heterotachy Improves Phylogenetic AccuracyMolecular Biology and Evolution, 2008
- Evaluation of the models handling heterotachy in phylogenetic inferenceBMC Ecology and Evolution, 2007
- Heterotachy in Mammalian Promoter EvolutionPLoS Genetics, 2006
- Covarion Structure in Plastid Genome Evolution: A New Statistical TestMolecular Biology and Evolution, 2005
- Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneousNature, 2004
- PHYLOGENY OF THE CHLOROPHYCEAE WITH SPECIAL REFERENCE TO THE SPHAEROPLEALES: A STUDY OF 18S AND 26S rDNA DATAJournal of Phycology, 2001
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 1995
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974