Sequence-structure matching in globular proteins: application to supersecondary and tertiary structure determination.
- 15 December 1992
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 89 (24) , 12098-12102
- https://doi.org/10.1073/pnas.89.24.12098
Abstract
A methodology designed to address the inverse globular protein-folding problem (the identification of which sequences are compatible with a given three-dimensional structure) is described. By using a library of protein finger-prints, defined by the side chain interaction pattern, it is possible to match each structure to its own sequence in an exhaustive data base search. It is shown that this is a permissive requirement for the validation of the methodology. To pass the more rigorous test of identifying proteins that are not close sequence homologs, but that have similar structure, the method has been extended to include insertions and deletions in the sequence, which is compared to the fingerprint. This allows for the identification of sequences having little or no sequence homology to the fingerprint. Examples include plastocyanin/azurin/pseudoazurin, the globin family, different families of proteases and cytochromes, including cytochromes c' and b-562, actinidin/papain, and lysozyme/alpha-lactalbumin. Turning to supersecondary structure prediction, we find that alpha/beta/alpha fragments possess sufficient specificity to identify their own and related sequences. By threading a beta-hairpin through a sequence, it is possible to predict the location of such hairpins and turns with remarkable fidelity. Thus, the method greatly extends existing techniques for the prediction of both global structural homology and local supersecondary structure.Keywords
This publication has 11 references indexed in Scilit:
- Tertiary templates for proteins: Use of packing criteria in the enumeration of allowed sequences for different structural classesPublished by Elsevier ,2005
- Topology fingerprint approach to the inverse protein folding problemJournal of Molecular Biology, 1992
- A lattice model for protein structure prediction at low resolution.Proceedings of the National Academy of Sciences, 1992
- Structure-derived hydrophobic potentialJournal of Molecular Biology, 1992
- Selection of representative protein data setsProtein Science, 1992
- A Method to Identify Protein Sequences That Fold into a Known Three-Dimensional StructureScience, 1991
- An 8-fold βα barrel protein with redundant folding possibilitiesProtein Engineering, Design and Selection, 1990
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977