Evaluation of the models handling heterotachy in phylogenetic inference
Open Access
- 1 November 2007
- journal article
- research article
- Published by Springer Nature in BMC Ecology and Evolution
- Vol. 7 (1) , 206
- https://doi.org/10.1186/1471-2148-7-206
Abstract
Background: The evolutionary rate at a given homologous position varies across time. When sufficiently pronounced, this phenomenon – called heterotachy – may produce artefactual phylogenetic reconstructions under the commonly used models of sequence evolution. These observations have motivated the development of models that explicitly recognize heterotachy, with research directions proposed along two main axes: 1) thecovarionapproach, where sites switch from variable to invariable states; and 2) themixture of branch lengths(MBL) approach, where alignment patterns are assumed to arise from one of several sets of branch lengths, under a given phylogeny.Results: Here, we report the first statistical comparisons contrasting the performance of covarion and MBL modeling strategies. Using simulations under heterotachous conditions, we explore the properties of three model comparison methods: the Akaike information criterion, the Bayesian information criterion, and cross validation. Although more time consuming, cross validation appears more reliable than AIC and BIC as it directly measures the predictive power of a model on 'future' data. We also analyze three large datasets (nuclear proteins of animals, mitochondrial proteins of mammals, and plastid proteins of plants), and find the optimal number of components of the MBL model to be two for all datasets, indicating that this model is preferred over the standard homogeneous model. However, the covarion model is always favored over the optimal MBL model.Conclusion: We demonstrated, using three large datasets, that the covarion model is more efficient at handling heterotachy than the MBL model. This is probably due to the fact that the MBL model requires a serious increase in the number of parameters, as compared to two supplementary parameters of the covarion approach. Further improvements of the both the mixture and the covarion approaches might be obtained by modeling heterogeneous behavior both along time and across sites.Keywords
This publication has 60 references indexed in Scilit:
- Identifying dramatic selection shifts in phylogenetic treesBMC Ecology and Evolution, 2007
- Heterotachy Processes in Rhodophyte-Derived Secondhand Plastid Genes: Implications for Addressing the Origin and Evolution of Dinoflagellate PlastidsMolecular Biology and Evolution, 2006
- Heterotachy in Mammalian Promoter EvolutionPLoS Genetics, 2006
- A call for likelihood phylogenetics even when the process of sequence evolution is heterogeneousMolecular Phylogenetics and Evolution, 2005
- PhylogenomicsAnnual Review of Ecology, Evolution, and Systematics, 2005
- Performance of maximum parsimony and likelihood phylogenetics when evolution is heterogeneousNature, 2004
- Asymptotic Optimality of Likelihood-Based Cross-ValidationStatistical Applications in Genetics and Molecular Biology, 2004
- An entropy criterion for assessing the number of clusters in a mixture modelJournal of Classification, 1996
- Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard ConditionsJournal of the American Statistical Association, 1987
- Estimating the Dimension of a ModelThe Annals of Statistics, 1978