Protein binding site prediction using an empirical scoring function
Open Access
- 1 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (13) , 3698-3707
- https://doi.org/10.1093/nar/gkl454
Abstract
Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches.Keywords
This publication has 44 references indexed in Scilit:
- Distinguishing Structural and Functional Restraints in Evolution in Order to Identify Interaction SitesJournal of Molecular Biology, 2004
- Prediction of functional sites by analysis of sequence and structure conservationProtein Science, 2004
- ProMate: A Structure Based Prediction Program to Identify the Location of Protein–Protein Binding SitesJournal of Molecular Biology, 2004
- Prediction of Catalytic Residues in Enzymes Based on Known Tertiary Structure, Stability Profile, and Sequence ConservationJournal of Molecular Biology, 2003
- Prediction of functionally important residues based solely on the computed energetics of protein structure 1 1Edited by B. HonigJournal of Molecular Biology, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation 1 1Edited by J. ThorntonJournal of Molecular Biology, 1999
- Analysis of protein-protein interaction sites using surface patches 1 1Edited by G.Von HeijneJournal of Molecular Biology, 1997
- Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von HeijneJournal of Molecular Biology, 1997
- An Evolutionary Trace Method Defines Binding Surfaces Common to Protein FamiliesJournal of Molecular Biology, 1996