Generation and analysis of a protein–protein interface data set with similar chemical and spatial patterns of interactions
- 3 August 2005
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 61 (1) , 6-20
- https://doi.org/10.1002/prot.20580
Abstract
Protein–protein interfaces are regions between 2 polypeptide chains that are not covalently connected. Here, we have created a nonredundant interface data set generated from all 2-chain interfaces in the Protein Data Bank. This data set is unique, since it contains clusters of interfaces with similar shapes and spatial organization of chemical functional groups. The data set allows statistical investigation of similar interfaces, as well as the identification and analysis of the chemical forces that account for the protein–protein associations. Toward this goal, we have developed I2I-SiteEngine (Interface-to-Interface SiteEngine) [Data set available at http://bioinfo3d.cs.tau.ac.il/Interfaces; Web server: http://bioinfo3d.cs.tau.ac.il/I2I-SiteEngine]. The algorithm recognizes similarities between protein–protein binding surfaces. I2I-SiteEngine is independent of the sequence or the fold of the proteins that comprise the interfaces. In addition to geometry, the method takes into account both the backbone and the side-chain physicochemical properties of the interacting atom groups. Its high efficiency makes it suitable for large-scale database searches and classifications. Below, we briefly describe the I2I-SiteEngine method. We focus on the classification process and the obtained nonredundant protein–protein interface data set. In particular, we analyze the biological significance of the clusters and present examples which illustrate that given constellations of chemical groups in protein–protein binding sites may be preferred, and are observed in proteins with different structures and different functions. We expect that these would yield further information regarding the forces stabilizing protein–protein interactions. Proteins 2005. 2005 Wiley-Liss, Inc.Keywords
This publication has 118 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- Recognition of Functional Sites in Protein StructuresJournal of Molecular Biology, 2004
- ProMate: A Structure Based Prediction Program to Identify the Location of Protein–Protein Binding SitesJournal of Molecular Biology, 2004
- Identification of protein biochemical functions by similarity search using the molecular surface database eF‐siteProtein Science, 2003
- Solution structure and dynamics of yeast elongin C in complex with a von hippel-lindau peptide 1 1Edited by M. F. SummersJournal of Molecular Biology, 2001
- The Protein Data BankNucleic Acids Research, 2000
- The atomic structure of protein-protein recognition sites 1 1Edited by A. R. FershtJournal of Molecular Biology, 1999
- Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipasesProtein Science, 1996
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Algorithm 457: finding all cliques of an undirected graphCommunications of the ACM, 1973