Estimating the Genomewide Rate of Adaptive Protein Evolution in Drosophila
- 1 June 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 173 (2) , 821-837
- https://doi.org/10.1534/genetics.106.056911
Abstract
When polymorphism and divergence data are available for multiple loci, extended forms of the McDonald–Kreitman test can be used to estimate the average proportion of the amino acid divergence due to adaptive evolution—a statistic denoted $\batchmode \documentclass[fleqn,10pt,legalpaper]{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(\mathrm{{\bar{{\alpha}}}}\) \end{document}$. But such tests are subject to many biases. Most serious is the possibility that high estimates of $\batchmode \documentclass[fleqn,10pt,legalpaper]{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(\mathrm{{\bar{{\alpha}}}}\) \end{document}$ reflect demographic changes rather than adaptive substitution. Testing for between-locus variation in α is one possible way of distinguishing between demography and selection. However, such tests have yielded contradictory results, and their efficacy is unclear. Estimates of $\batchmode \documentclass[fleqn,10pt,legalpaper]{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(\mathrm{{\bar{{\alpha}}}}\) \end{document}$ from the same model organisms have also varied widely. This study clarifies the reasons for these discrepancies, identifying several method-specific biases in widely used estimators and assessing the power of the methods. As part of this process, a new maximum-likelihood estimator is introduced. This estimator is applied to a newly compiled data set of 115 genes from Drosophila simulans, each with each orthologs from D. melanogaster and D. yakuba. In this way, it is estimated that $\batchmode \documentclass[fleqn,10pt,legalpaper]{article} \usepackage{amssymb} \usepackage{amsfonts} \usepackage{amsmath} \pagestyle{empty} \begin{document} \(\mathrm{{\bar{{\alpha}}}}{\approx}0.4{\pm}0.1\) \end{document}$, a value that does not vary substantially between different loci or over different periods of divergence. The implications of these results are discussed.
Keywords
This publication has 46 references indexed in Scilit:
- Patterns of Selection on Synonymous and Nonsynonymous Variants in Drosophila mirandaGenetics, 2005
- Dobzhansky–Muller incompatibilities in protein evolutionProceedings of the National Academy of Sciences, 2002
- The cost of inbreeding in ArabidopsisNature, 2002
- Adaptive protein evolution in DrosophilaNature, 2002
- Testing the neutral theory of molecular evolution with genomic data from DrosophilaNature, 2002
- Bayes FactorsJournal of the American Statistical Association, 1995
- Bayes FactorsJournal of the American Statistical Association, 1995
- Adaptive protein evolution at the Adh locus in DrosophilaNature, 1991
- Model SelectionJournal of Marketing Research, 1988
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974