Three‐dimensional, sequence order‐independent structural comparison of a serine protease against the crystallographic database reveals active site similarities: Potential implications to evolution and to protein folding
- 1 May 1994
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 3 (5) , 769-778
- https://doi.org/10.1002/pro.5560030506
Abstract
We have recently developed a fast approach to comparisons of 3‐dimensional structures. Our method is unique, treating protein structures as collections of unconnected points (atoms) in space. It is completely independent of the amino acid sequence order. It is unconstrained by insertions, deletions, and chain directionality. It matches single, isolated amino acids between 2 different structures strictly by their spatial positioning regardless of their relative sequential position in the amino acid chain. It automatically detects a recurring 3D motif in protein molecules. No predefinition of the motif is required. The motif can be either in the interior of the proteins or on their surfaces. In this work, we describe an enhancement over our previously developed technique, which considerably reduces the complexity of the algorithm. This results in an extremely fast technique. A typical pairwise comparison of 2 protein molecules requires less than 3 s on a workstation. We have scanned the structural database with dozens of probes, successfully detecting structures that are similar to the probe. To illustrate the power of this method, we compare the structure of a trypsin‐like serine protease against the structural database. Besides detecting homologous trypsin‐like proteases, we automatically obtain 3D, sequence order‐independent, active‐site similarities with subtilisin‐like and sulfhydryl proteases. These similarities equivalence isolated residues, not conserving the linear order of the amino acids in the chains. The active‐site similarities are well known and have been detected by manually inspecting the structures in a time‐consuming, laborious procedure. This is the first time such equivalences are obtained automatically from the comparison of full structures. The far‐reaching advantages and the implications of our novel algorithm to studies of protein folding, to evolution, and to searches for pharmacophoric patterns are discussed.Keywords
This publication has 23 references indexed in Scilit:
- Refined Structure of Sindbis Virus Core Protein and Comparison with Other Chymotrypsin-like Serine Proteinase StructuresJournal of Molecular Biology, 1993
- Identification of Tertiary Structure Resemblance in Proteins Using a Maximal Common Subgraph Isomorphism AlgorithmJournal of Molecular Biology, 1993
- Common spatial arrangements of backbone fragments in homologous and non-homologous proteinsJournal of Molecular Biology, 1992
- Use of techniques derived from graph theory to compare secondary structure motifs in proteinsJournal of Molecular Biology, 1990
- Protein structure alignmentJournal of Molecular Biology, 1989
- Synchrotron X-ray data collection and restrained least-squares refinement of the crystal structure of proteinase K at 1.5 Å resolutionActa crystallographica Section B, Structural science, crystal engineering and materials, 1988
- Crystal and molecular structures of the complex of α-chymotrypsin with its inhibitor Turkey ovomucoid third domain at 1.8 Å resolutionJournal of Molecular Biology, 1987
- Structure of papain refined at 1.65 Å resolutionJournal of Molecular Biology, 1984
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977