Self‐consistently optimized statistical mechanical energy functions for sequence structure alignment
Open Access
- 1 June 1996
- journal article
- review article
- Published by Wiley in Protein Science
- Vol. 5 (6) , 1043-1059
- https://doi.org/10.1002/pro.5560050607
Abstract
A quantitative form of the principle of minimal frustration is used to obtain from a database analysis statistical mechanical energy functions and gap parameters for aligning sequences to three-dimensional structures. The analysis that partially takes into account correlations in the energy landscape improves upon the previous approximations of Goldstein et al. (1994, 1995) (Goldstein R, Luthey-Schulten Z, Wolynes P, 1994, Proceedings of the 27th Hawaii International Conference on System Sciences. Los Alamitos, California: IEEE Computer Society Press. pp 306–315; Goldstein R, Luthey-Schulten Z, Wolynes P, 1995, In: Elber R, ed. New developments in theoretical studies of proteins. Singapore: World Scientific). The energy function allows for ordering of alignments based on the compatibility of a sequence to be in a given structure (i.e., lowest energy) and therefore removes the necessity of using percent identity or similarity as scoring parameters. The alignments produced by the energy function on distant homologues with low percent identity (less than 21%) are generally better than those generated with evolutionary information. The lowest energy alignment generated with the energy function for sequences containing prosite signatures but unknown structures is a structure containing the same prosite signature, providing a check on the robustness of the algorithm. Finally, the energy function can make use of known experimental evidence as constraints within the alignment algorithm to aid in finding the correct structural alignment.Keywords
Funding Information
- National Institutes of Health ((PHS2 R02GM44557-06))
This publication has 55 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- Topology fingerprint approach to the inverse protein folding problemJournal of Molecular Biology, 1992
- Suboptimal sequence alignment in molecular biologyJournal of Molecular Biology, 1991
- Crystal structure of thioredoxin from Escherichia coli at 1.68 Å resolutionJournal of Molecular Biology, 1990
- The MIDAS display systemJournal of Molecular Graphics, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Structure of the semiquinone form of flavodoxin from Clostridium MPJournal of Molecular Biology, 1977
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Structure of myoglobin refined at 2·0 Å resolutionJournal of Molecular Biology, 1977
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970