Reducing the computational complexity of protein folding via fragment folding and assembly
- 1 June 2003
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 12 (6) , 1177-1187
- https://doi.org/10.1110/ps.0232903
Abstract
Understanding, and ultimately predicting, how a 1-D protein chain reaches its native 3-D fold has been one of the most challenging problems during the last few decades. Data increasingly indicate that protein folding is a hierarchical process. Hence, the question arises as to whether we can use the hierarchical concept to reduce the practically intractable computational times. For such a scheme to work, the first step is to cut the protein sequence into fragments that form local minima on the polypeptide chain. The conformations of such fragments in solution are likely to be similar to those when the fragments are embedded in the native fold, although alternate conformations may be favored during the mutual stabilization in the combinatorial assembly process. Two elements are needed for such cutting: (1) a library of (clustered) fragments derived from known protein structures and (2) an assignment algorithm that selects optimal combinations to "cover" the protein sequence. The next two steps in hierarchical folding schemes, not addressed here, are the combinatorial assembly of the fragments and finally, optimization of the obtained conformations. Here, we address the first step in a hierarchical protein-folding scheme. The input is a target protein sequence and a library of fragments created by clustering building blocks that were generated by cutting all protein structures. The output is a set of cutout fragments. We briefly outline a graph theoretic algorithm that automatically assigns building blocks to the target sequence, and we describe a sample of the results we have obtained.Keywords
This publication has 46 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- The interpretation of protein structures: Estimation of static accessibilityPublished by Elsevier ,2004
- Environment and exposure to solvent of protein atoms. Lysozyme and insulinPublished by Elsevier ,2004
- Do aligned sequences share the same fold?Journal of Molecular Biology, 1997
- The foldon universe: a survey of structural similarity and self-recognition of independently folding units 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- An automated classification of the structure of protein loopsJournal of Molecular Biology, 1997
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Hierarchic organization of domains in globular proteinsJournal of Molecular Biology, 1979
- The tree structural organization of proteinsJournal of Molecular Biology, 1978
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977