Prediction of Site-Specific Amino Acid Distributions and Limits of Divergent Evolutionary Changes in Protein Sequences
Open Access
- 10 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (3) , 630-638
- https://doi.org/10.1093/molbev/msi048
Abstract
We derive an analytic expression for site-specific stationary distributions of amino acids from the structurally constrained neutral (SCN) model of protein evolution with conservation of folding stability. The stationary distributions that we obtain have a Boltzmann-like shape, and their effective temperature parameter, measuring the limit of divergent evolutionary changes at a given site, can be predicted from a site-specific topological property, the principal eigenvector of the contact matrix of the native conformation of the protein. These analytic results, obtained without free parameters, are compared with simulations of the SCN model and with the site-specific amino acid distributions obtained from the Protein Data Bank. These results also provide new insights into how the topology of a protein fold influences its designability, i.e., the number of sequences compatible with that fold. The dependence of the effective temperature on the principal eigenvector decreases for longer proteins, as a possible consequence of the fact that selection for thermodynamic stability becomes weaker in this case.Keywords
All Related Versions
This publication has 50 references indexed in Scilit:
- Structural Determinant of Protein DesignabilityPhysical Review Letters, 2003
- Understanding hierarchical protein evolution from first principles 1 1Edited by J. ThorntonJournal of Molecular Biology, 2001
- Structurally constrained protein evolution: results from a lattice simulationZeitschrift für Physik B Condensed Matter, 2000
- Neutral Evolution of Model Proteins: Diffusion in Sequence Space and OverdispersionJournal of Theoretical Biology, 1999
- Non-functional conserved residues in globins and their possible role as a folding nucleus 1 1Edited by F. E. CohenJournal of Molecular Biology, 1999
- Enlarged representative set of protein structuresProtein Science, 1994
- Amino acid preferences of small proteinsJournal of Molecular Biology, 1992
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Structure-derived hydrophobic potentialJournal of Molecular Biology, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981