A Phylogenetic Mixture Model for Detecting Pattern-Heterogeneity in Gene Sequence or Character-State Data
Top Cited Papers
Open Access
- 1 August 2004
- journal article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 53 (4) , 571-581
- https://doi.org/10.1080/10635150490468675
Abstract
We describe a general likelihood-based ‘mixture model’ for inferring phylogenetic trees from gene-sequence or other character-state data. The model accommodates cases in which different sites in the alignment evolve in qualitatively distinct ways, but does not require prior knowledge of these patterns or partitioning of the data. We call this qualitative variability in the pattern of evolution across sites “pattern-heterogeneity” to distinguish it from both a homogenous process of evolution and from one characterized principally by differences in rates of evolution. We present studies to show that the model correctly retrieves the signals of pattern-heterogeneity from simulated gene-sequence data, and we apply the method to protein-coding genes and to a ribosomal 12S data set. The mixture model outperforms conventional partitioning in both these data sets. We implement the mixture model such that it can simultaneously detect rate- and pattern-heterogeneity. The model simplifies to a homogeneous model or a rate-variability model as special cases, and therefore always performs at least as well as these two approaches, and often considerably improves upon them. We make the model available within a Bayesian Markov-chain Monte Carlo framework for phylogenetic inference, as an easy-to-use computer program.Keywords
This publication has 29 references indexed in Scilit:
- Bayesian Phylogenetics Using an RNA Substitution Model Applied to Early Mammalian EvolutionMolecular Biology and Evolution, 2002
- Bayesian Inference of Phylogeny and Its Impact on Evolutionary BiologyScience, 2001
- Variation in the Pattern of Nucleotide Substitution Across SitesJournal of Molecular Evolution, 1999
- Models of natural mutations including site heterogeneityProteins-Structure Function and Bioinformatics, 1998
- Compensatory neutral mutations and the evolution of RNA.Genetica, 1998
- Conserved sequence motifs, alignment, and secondary structure for the third domain of animal 12S rRNAMolecular Biology and Evolution, 1996
- Bayesian Data AnalysisPublished by Taylor & Francis ,1995
- Practical Markov Chain Monte CarloStatistical Science, 1992
- Ribosomal DNA: Molecular Evolution and Phylogenetic InferenceThe Quarterly Review of Biology, 1991
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981