Sampling protein conformations using segment libraries and a genetic algorithm
- 8 March 1997
- journal article
- research article
- Published by AIP Publishing in The Journal of Chemical Physics
- Vol. 106 (10) , 4270-4281
- https://doi.org/10.1063/1.473514
Abstract
We present a new simulation algorithm for minimizing empirical contact potentials for a simplified model of protein structure. The model consists of backbone atoms only (including Cβ) with the φ and ψ dihedral angles as the only degrees of freedom. In addition, φ and ψ are restricted to a finite set of 532 discrete pairs of values, and the secondary structural elements are held fixed in ideal geometries. The potential function consists of a look-up table based on discretized inter-residue atomic distances. The minimization consists of two principal elements: the use of preselected lists of trial moves and the use of a genetic algorithm. The trial moves consist of substitutions of one or two complete loop regions, and the lists are in turn built up using preselected lists of randomly-generated three-residue segments. The genetic algorithm consists of mutation steps (namely, the loop replacements), as well as a hybridization step in which new structures are created by combining parts of two “parents’’ and a selection step in which hybrid structures are introduced into the population. These methods are combined into a Monte Carlo simulated annealing algorithm which has the overall structure of a random walk on a restricted set of preselected conformations. The algorithm is tested using two types of simple model potential. The first uses global information derived from the radius of gyration and the rms deviation to drive the folding, whereas the second is based exclusively on distance-geometry constraints. The hierarchical algorithm significantly outperforms conventional Monte Carlo simulation for a set of test proteins in both cases, with the greatest advantage being for the largest molecule having 193 residues. When tested on a realistic potential function, the method consistently generates structures ranked lower than the crystal structure. The results also show that the improved efficiency of the hierarchical algorithm exceeds that which would be anticipated from tests on either of the two main elements used independently.Keywords
This publication has 33 references indexed in Scilit:
- A configurational bias Monte Carlo method for linear and cyclic peptidesMolecular Physics, 1996
- Protein dynamics with off-lattice Monte Carlo movesPhysical Review E, 1996
- Global Fold Determination from a Small Number of Distance RestraintsJournal of Molecular Biology, 1995
- Reduced representation approach to protein tertiary structure prediction: statistical potential and simulated annealingJournal of Theoretical Biology, 1995
- Prediction of peptide conformation by multicanonical algorithm: New approach to the multiple‐minima problemJournal of Computational Chemistry, 1993
- Genetic Algorithms for Protein Folding SimulationsJournal of Molecular Biology, 1993
- Discretized model of proteins. I. Monte Carlo study of cooperativity in homopolypeptidesThe Journal of Chemical Physics, 1992
- Structure-derived hydrophobic potentialJournal of Molecular Biology, 1992
- Prediction of protein backbone conformation based on seven structure assignmentsJournal of Molecular Biology, 1991
- Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculationsFEBS Letters, 1988