A Genetic Algorithm Approach to Detecting Lineage-Specific Variation in Selection Pressure
- 27 October 2004
- journal article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (3) , 478-485
- https://doi.org/10.1093/molbev/msi031
Abstract
The ratio of nonsynonymous (dN) to synonymous (dS) substitution rates, omega, provides a measure of selection at the protein level. Models have been developed that allow omega to vary among lineages. However, these models require the lineages in which differential selection has acted to be specified a priori. We propose a genetic algorithm approach to assign lineages in a phylogeny to a fixed number of different classes of omega, thus allowing variable selection pressure without a priori specification of particular lineages. This approach can identify models with a better fit than a single-ratio model, and with fits that are better than (in an information theoretic sense) a fully local model, in which all lineages are assumed to evolve under different values of omega, but with far fewer parameters. By averaging over models which explain the data reasonably well, we can assess the robustness of our conclusions to uncertainty in model estimation. Our approach can also be used to compare results from models in which branch classes are specified a priori with a wide range of credible models. We illustrate our methods on primate lysozyme sequences and compare them with previous methods applied to the same data sets.Keywords
This publication has 18 references indexed in Scilit:
- HyPhy: hypothesis testing using phylogeniesBioinformatics, 2004
- Model Selection and Multimodel InferencePublished by Springer Nature ,2004
- Optimizing the Order of Taxon Addition in Phylogenetic Tree Construction Using Genetic AlgorithmPublished by Springer Nature ,2003
- Genetic Algorithms and Parallel Processing in Maximum-Likelihood Phylogeny InferenceMolecular Biology and Evolution, 2002
- The metapopulation genetic algorithm: An efficient solution for the problem of large phylogeny estimationProceedings of the National Academy of Sciences, 2002
- Genetic Algorithm-Based Maximum-Likelihood Analysis for Molecular PhylogenyJournal of Molecular Evolution, 2001
- A genetic algorithm for maximum-likelihood phylogeny inference using nucleotide sequence dataMolecular Biology and Evolution, 1998
- Episodic adaptive evolution of primate lysozymesNature, 1997
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974
- On Information and SufficiencyThe Annals of Mathematical Statistics, 1951