Non-homogeneous models of sequence evolution in the Bio++ suite of libraries and programs

Open Access

1 January 2008

journal article
Published by Springer Nature in BMC Ecology and Evolution

Vol. 8 (1) , 255
https://doi.org/10.1186/1471-2148-8-255

Abstract

Accurately modeling the sequence substitution process is required for the correct estimation of evolutionary parameters, be they phylogenetic relationships, substitution rates or ancestral states; it is also crucial to simulate realistic data sets. Such simulation procedures are needed to estimate the null-distribution of complex statistics, an approach referred to as parametric bootstrapping, and are also used to test the quality of phylogenetic reconstruction programs. It has often been observed that homologous sequences can vary widely in their nucleotide or amino-acid compositions, revealing that sequence evolution has changed importantly among lineages, and may therefore be most appropriately approached through non-homogeneous models. Several programs implementing such models have been developed, but they are limited in their possibilities: only a few particular models are available for likelihood optimization, and data sets cannot be easily generated using the resulting estimated parameters.

Keywords

This publication has 32 references indexed in Scilit:

A Site- and Time-Heterogeneous Model of Amino Acid Replacement
Molecular Biology and Evolution, 2008
Detecting and Overcoming Systematic Errors in Genome-Scale Phylogenies
Systematic Biology, 2007
PAML 4: Phylogenetic Analysis by Maximum Likelihood
Molecular Biology and Evolution, 2007
The Biasing Effect of Compositional Heterogeneity on Phylogenetic Estimates May be Underestimated
Systematic Biology, 2004
A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum Likelihood
Systematic Biology, 2003
Bayesian Model Adequacy and Choice in Phylogenetics
Molecular Biology and Evolution, 2002
Heterotachy, an Important Process of Protein Evolution
Molecular Biology and Evolution, 2002
Maximum-Likelihood Phylogenetic Analysis Under a Covarion-like Model
Molecular Biology and Evolution, 2001
Likelihood-Based Tests of Topologies in Phylogenetics
Systematic Biology, 2000
Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees
Bioinformatics, 1997