An algorithm for predicting protein–protein interaction sites: Abnormally exposed amino acid residues and secondary structure elements
Open Access
- 1 May 2006
- journal article
- Published by Wiley in Protein Science
- Vol. 15 (5) , 1017-1029
- https://doi.org/10.1110/ps.051589106
Abstract
Multiprotein systems mediate most regulatory processes in living organisms. Although the structures of the individual proteins are often defined, less is known of the structures of multiprotein systems. Computational methods for predicting interfaces, using evolutionary conservation and/or physicochemical data, have been developed. Here we consider the use of solvent accessibility, residue propensity, and hydrophobicity, in conjunction with secondary structure data, as prediction parameters. We analyze the influence of residue type and secondary structure on solvent accessibility and define a measure of “relative exposedness.” Clustering abnormally high scoring residues provides a basis for predicting interaction sites. The analysis is extended to investigate abnormally exposed secondary structure elements, particularly β‐sheet strands. We show that surface‐exposed β‐strands lacking protective features are more likely to be found at protein–protein interfaces, allowing us to create an algorithm with ∼68% and ∼75% accuracy in differentiating between interacting and edge strands in isolated β‐strands and β‐sheet strands, respectively. These methods of identifying abnormally exposed surface regions are combined in an algorithm, which, on a data set of 77 unbound and disjoint (single chain extracted from complex) structures, predicts 79% of the protein–protein interfaces correctly. If enzyme–inhibitor complexes, where the inhibitor mimics a nonprotein substrate, are excluded, the accuracy increases to 85%.Keywords
This publication has 49 references indexed in Scilit:
- Distinguishing Structural and Functional Restraints in Evolution in Order to Identify Interaction SitesJournal of Molecular Biology, 2004
- Twist and shear in β-sheets and β-ribbonsJournal of Molecular Biology, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Analysis of protein-protein interaction sites using surface patches 1 1Edited by G.Von HeijneJournal of Molecular Biology, 1997
- Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von HeijneJournal of Molecular Biology, 1997
- AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMRJournal of Biomolecular NMR, 1996
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981