A New Method for Estimating Nonsynonymous Substitutions and Its Applications to Detecting Positive Selection
Open Access
- 19 October 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 23 (2) , 372-379
- https://doi.org/10.1093/molbev/msj043
Abstract
The standard methods for computing the number of nonsynonymous substitutions (Ka) lump all amino acid changes into one single class, even though their rates of substitution vary by at least 10-fold (Tang et al., 2004). Classifying these changes by their physicochemical properties has not been suitably effective in isolating the fastest evolving classes of changes. We now propose to use the Universal index U of Tang et al. (2004) to classify the 75 elementary amino acid changes (codons differing by 1 bp) by their evolutionary exchangeability. Let Ki denote the Ka value of each class (i = 1, …, 75 from the most to the least exchangeable). The cumulative Ki for the top 10 classes, denoted Kh (for high-exchangeability types), has two important properties: (1) Kh usually accounts for 25%–30% of total amino acid changes and (2) when the observed number of amino acid substitutions is large, Kh is predictably twice the value of Ka. This shall be referred to as the twofold approximation. The new method for estimating Kh is applied to the comparisons between human and macaque and between mouse and rat. The twofold approximation holds well in these data sets, and the signature of positive selection can be more easily discerned using the Kh statistic than using Ka. Many genes with Ka/Ks > 0.5 can now be shown to have Kh/Ks > 1 and to have evolved adaptively, at least for the high-exchangeability group of amino acid changes.Keywords
This publication has 28 references indexed in Scilit:
- Codon volatility does not detect selectionNature, 2005
- Codon bias and selection on single genomesNature, 2005
- The Comparative Method Rules! Codon Volatility Cannot Detect Positive Darwinian Selection Using a Single Genome SequenceMolecular Biology and Evolution, 2004
- Frequent False Detection of Positive Selection by the Likelihood Method with Branch-Site ModelsMolecular Biology and Evolution, 2004
- A Universal Evolutionary Index for Amino Acid ChangesMolecular Biology and Evolution, 2004
- Patterns of Transitional Mutation Biases Within and Among Mammalian GenomesMolecular Biology and Evolution, 2003
- Testing the neutral theory of molecular evolution with genomic data from DrosophilaNature, 2002
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991
- Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selectionNature, 1988
- Amino Acid Difference Formula to Help Explain Protein EvolutionScience, 1974