Families and the structural relatedness among globular proteins
- 1 June 1993
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 2 (6) , 884-899
- https://doi.org/10.1002/pro.5560020603
Abstract
Protein structures come in families. Are families “closely knit” or “loosely knit” entities? We describe a measure of relatedness among polymer conformations. Based on weighted distance maps, this measure differs from existing measures mainly in two respects: (1) it is computationally fast, and (2) it can compare any two proteins, regardless of their relative chain lengths or degree of similarity. It does not require finding relative alignments. The measure is used here to determine the dissimilarities between all 12, 403 possible pairs of 158 diverse protein structures from the Brookhaven Protein Data Bank (PDB). Combined with minimal spanning trees and hierarchical clustering methods, this measure is used to define structural families. It is also useful for rapidly searching a dataset of protein structures for specific substructural motifs. By using an analogy to distributions of Euclidean distances, we find that protein families are not tightly knit entities.Keywords
This publication has 27 references indexed in Scilit:
- THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNSAnnual Review of Biochemistry, 1990
- Definition of general topological equivalence in protein structuresJournal of Molecular Biology, 1990
- Protein structure alignmentJournal of Molecular Biology, 1989
- Structure of the amino-terminal domain of phage 434 represser at 2.0 Å resolutionJournal of Molecular Biology, 1989
- The structure, function and evolution of cytochromesProgress in Biophysics and Molecular Biology, 1985
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Exploring structural homology of proteinsJournal of Molecular Biology, 1976
- Structural patterns in globular proteinsNature, 1976
- Troponin and Parvalbumin Calcium Binding Regions Predicted in Myosin Light Chain and T4 LysozymeScience, 1975
- Recognition of structural domains in globular proteinsJournal of Molecular Biology, 1974