Highly Expressed and Slowly Evolving Proteins Share Compositional Properties with Thermophilic Proteins
Open Access
- 12 November 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 27 (3) , 735-741
- https://doi.org/10.1093/molbev/msp270
Abstract
The sequences of proteins encoded by a genome evolve at different rates. A correlate of a protein's evolutionary rate is its expression level: highly expressed proteins tend to evolve slowly. Some explanations of rate variation and the correlation between rate and expression predict that more slowly evolving and more highly expressed proteins have more favorable equilibrium constants for folding. Proteins from thermophiles generally have more stable folds than proteins from mesophiles, and it is known that there are systematic differences in amino acid content between thermophilic and mesophilic proteins. I examined whether there are analogous correlations of amino acid frequencies with evolutionary rate and expression level within genomes. In most of the organisms analyzed, there is a striking tendency for more slowly evolving proteins to be more thermophile-like in their amino acid compositions when adjustments are made for variation in GC content. More highly expressed proteins also tend to be more thermophile-like by the same criteria. These results suggest that part of the evolutionary rate variation among proteins is due to variation in the strength of selection for stability of the folded state. They also suggest that increasing strength of this selective force with expression level plays a role in the correlation between evolutionary rate and expression level.Keywords
This publication has 30 references indexed in Scilit:
- The universal distribution of evolutionary rates of genes and distinct characteristics of eukaryotic genes of different apparent agesProceedings of the National Academy of Sciences, 2009
- Comparative Functional Analysis of the Caenorhabditis elegans and Drosophila melanogaster ProteomesPLoS Biology, 2009
- Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence EvolutionCell, 2008
- TreeFam: 2008 UpdateNucleic Acids Research, 2007
- Using FlyAtlas to identify better Drosophila melanogaster models of human diseaseNature Genetics, 2007
- Protein and DNA Sequence Determinants of Thermophilic AdaptationPLoS Computational Biology, 2007
- Intrinsically unstructured proteins and their functionsNature Reviews Molecular Cell Biology, 2005
- Integrating high-throughput and computational data elucidates bacterial networksNature, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003