A Statistical Model for Predicting Protein Folding Rates from Amino Acid Sequence with Structural Class Information
- 26 January 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 45 (2) , 494-501
- https://doi.org/10.1021/ci049757q
Abstract
Prediction of protein folding rates from amino acid sequences is one of the most important challenges in molecular biology. In this work, I have related the protein folding rates with physical-chemical, energetic and conformational properties of amino acid residues. I found that the classification of proteins into different structural classes shows an excellent correlation between amino acid properties and folding rates of two- and three-state proteins, indicating the importance of native state topology in determining the protein folding rates. I have formulated a simple linear regression model for predicting the protein folding rates from amino acid sequences along with structural class information and obtained an excellent agreement between predicted and experimentally observed folding rates of proteins; the correlation coefficients are 0.99, 0.96 and 0.95, respectively, for all-α, all-β and mixed class proteins. This is the first available method, which is capable of predicting the protein folding rates just from the amino acid sequence with the aid of generic amino acid properties and structural class information.Keywords
This publication has 53 references indexed in Scilit:
- Fast folding of Escherichia coli cyclophilin A: a hypothesis of a unique hydrophobic core with a phenylalanine clusterJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Submillisecond folding of the peripheral subunit-binding domainJournal of Molecular Biology, 1999
- Rapid folding with and without populated intermediates in the homologous four-helix proteins Im7 and Im9Journal of Molecular Biology, 1999
- Global analysis of the effects of temperature and denaturant on the folding and unfolding kinetics of the N-terminal domain of the protein L9 1 1Edited by P. E. WrightJournal of Molecular Biology, 1998
- Contact order, transition state placement and the refolding rates of single domain proteins 1 1Edited by P. E. WrightJournal of Molecular Biology, 1998
- Thermodynamic stability and folding of GroEL minichaperones 1 1Edited by P. E. WrightJournal of Molecular Biology, 1998
- A comparison of the folding kinetics and thermodynamics of two homologous fibronectin type III modulesJournal of Molecular Biology, 1997
- Fast and One-step Folding of Closely and Distantly Related Homologous Proteins of a Four-helix Bundle FamilyJournal of Molecular Biology, 1996
- Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteinsProtein Engineering, Design and Selection, 1996