Determination of the conformation of folding initiation sites in proteins by computer simulation
- 1 October 1995
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 23 (2) , 129-141
- https://doi.org/10.1002/prot.340230203
Abstract
Experimental evidence and theoretical models both suggest that protein folding begins by specific short regions of the polypeptide chain intermittently assuming conformations close to their final ones. The independent folding properties and small size of these folding initiation sites make them suitable subjects for computational methods aimed at deriving structure from sequence. We have used a torsion space Monte Carlo procedure together with an all-atom free energy function to investigate the folding of a set of such sites. The free energy function is derived by a potential of mean force analysis of experimental protein structures. The most important contributions to the total free energy are the local main chain electrostatics, main chain hydrogen bonds, and the burial of nonpolar area. Six proposed independent folding units and four control peptides 11–14 residues long have been investigated. Thirty Monte Carlo simulations were performed on each peptide, starting from different random conformations. Five of the six folding units adopted conformations close to the experimental ones in some of the runs. None of the controls did so, as expected. The generated conformations which are close to the experimental ones have among the lowest free energies encountered, although some less native like low free energy conformations were also found. The effectiveness of the method on these peptides, which have a wide variety of experimental conformations, is encouraging in two ways: First, it provides independent evidence that these regions of the sequences are able to adopt native like conformations early in folding, and therefore are most probably key components of the folding pathways. Second, it demonstrates that available simulation methods and free energy functions are able to produce reasonably accurate structures. Extensions of the methods to the folding of larger portions of proteins are suggested.Keywords
This publication has 65 references indexed in Scilit:
- Environment and exposure to solvent of protein atoms. Lysozyme and insulinPublished by Elsevier ,2004
- Folding of peptide fragments comprising the complete sequence of proteinsJournal of Molecular Biology, 1992
- The folding of an enzyme: VI. The folding pathway of barnase: Comparison with theoretical modelsJournal of Molecular Biology, 1992
- The folding of an enzyme: IV. Structure of an intermediate in the refolding of barnase analysed by a protein engineering procedureJournal of Molecular Biology, 1992
- Structure-derived hydrophobic potentialJournal of Molecular Biology, 1992
- Straight-chain non-polar amino acids are good helix-formers in waterJournal of Molecular Biology, 1991
- Structure of the haemagglutinin membrane glycoprotein of influenza virus at 3 Å resolutionNature, 1981
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Nucleation, Rapid Folding, and Globular Intrachain Regions in ProteinsProceedings of the National Academy of Sciences, 1973