Fast Protein Folding in the Hydrophobic–Hydrophilic Model within Three-Eighths of Optimal
- 1 January 1996
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 3 (1) , 53-96
- https://doi.org/10.1089/cmb.1996.3.53
Abstract
We present performance-guaranteed approximation algorithms for the protein folding problem in the hydrophobic–hydrophilic model (Dill, 1985). Our algorithms are the first approximation algorithms in the literature with guaranteed performance for this model (Dill, 1994). The hydrophobic–hydrophilic model abstracts the dominant force of protein folding: the hydrophobic interaction. The protein is modeled as a chain of amino acids of length n that are of two types; H (hydrophobic, i.e., nonpolar) and P (hydrophilic, i.e., polar). Although this model is a simplification of more complex protein folding models, the protein folding structure prediction problem is notoriously difficult for this model. Our algorithms have linear (3n) or quadratic time and achieve a three-dimensional protein conformation that has a guaranteed free energy no worse than three-eighths of optimal. This result answers the open problem of Ngo et al. (1994) about the possible existence of an efficient approximation algorithm with guaranteed performance for protein structure prediction in any well-studied model of protein folding. By achieving speed and near-optimality simultaneously, our algorithms rigorously capture salient features of the recently proposed framework of protein folding by Sali et al. (1994). Equally important, the final conformations of our algorithms have significant secondary structure (antiparallel sheets, β-sheets, compact hydrophobic core). Furthermore, hypothetical folding pathways can be described for our algorithms that fit within the framework of diffusion-collision protein folding proposed by Karplus and Weaver (1979). Computational limitations of algorithms that compute the optimal conformation have restricted their applicability to short sequences (length ≤ 90). Because our algorithms trade computational accuracy for speed, they can construct near-optimal conformations in linear time for sequences of any size.Keywords
This publication has 26 references indexed in Scilit:
- Side‐chain entropy and packing in proteinsProtein Science, 1994
- Complexity of protein foldingBulletin of Mathematical Biology, 1993
- Cooperativity in protein-folding kinetics.Proceedings of the National Academy of Sciences, 1993
- Protein core assembly processesThe Journal of Chemical Physics, 1993
- Polymer Principles in Protein Structure and StabilityAnnual Review of Biophysics, 1991
- Origins of structure in globular proteins.Proceedings of the National Academy of Sciences, 1990
- Theory for the folding and stability of globular proteinsBiochemistry, 1985
- Circular and circularly permuted forms of bovine pancreatic trypsin inhibitorJournal of Molecular Biology, 1983
- Experimental and Theoretical Aspects of Protein FoldingAdvances in Protein Chemistry, 1975
- Helix-coil transition of the isolated amino terminus of ribonucleaseBiochemistry, 1971