Relating destabilizing regions to known functional sites in proteins

Open Access

30 April 2007

journal article
Published by Springer Nature in BMC Bioinformatics

Vol. 8 (1) , 141
https://doi.org/10.1186/1471-2105-8-141

Abstract

Most methods for predicting functional sites in protein 3D structures, rely on information on related proteins and cannot be applied to proteins with no known relatives. Another limitation of these methods is the lack of a well annotated set of functional sites to use as benchmark for validating their predictions. Experimental findings and theoretical considerations suggest that residues involved in function often contribute unfavorably to the native state stability. We examine the possibility of systematically exploiting this intrinsic property to identify functional sites using an original procedure that detects destabilizing regions in protein structures. In addition, to relate destabilizing regions to known functional sites, a novel benchmark consisting of a diverse set of hand-curated protein functional sites is derived. A procedure for detecting clusters of destabilizing residues in protein structures is presented. Individual residue contributions to protein stability are evaluated using detailed atomic models and a force-field successfully applied in computational protein design. The most destabilizing residues, and some of their closest neighbours, are clustered into destabilizing regions following a rigorous protocol. Our procedure is applied to high quality apo-structures of 63 unrelated proteins. The biologically relevant binding sites of these proteins were annotated using all available information, including structural data and literature curation, resulting in the largest hand-curated dataset of binding sites in proteins available to date. Comparing the destabilizing regions with the annotated binding sites in these proteins, we find that the overlap is on average limited, but significantly better than random. Results depend on the type of bound ligand. Significant overlap is obtained for most polysaccharide- and small ligand-binding sites, whereas no overlap is observed for most nucleic acid binding sites. These differences are rationalised in terms of the geometry and energetics of the binding site. We find that although destabilizing regions as detected here can in general not be used to predict binding sites in protein structures, they can provide useful information, particularly on the location of functional sites that bind polysaccharides and small ligands. This information can be exploited in methods for predicting function in protein structures with no known relatives. Our publicly available benchmark of hand-curated functional sites in proteins should help other workers derive and validate new prediction methods.

Keywords

This publication has 73 references indexed in Scilit:

JAFA: a protein function annotation meta-server
Nucleic Acids Research, 2006
Inference of Protein Function from Protein Structure
Published by Elsevier ,2005
Network Analysis of Protein Structures Identifies Functional Residues
Journal of Molecular Biology, 2004
Analysis of Anisotropic Side-chain Packing in Proteins and Application to High-resolution Structure Prediction
Journal of Molecular Biology, 2004
Enzyme/Non-enzyme Discrimination and Prediction of Enzyme Active Site Location Using Charge-based Methods
Journal of Molecular Biology, 2004
Crystal Structure of the C.perfringens Alpha-toxin with the Active Site Closed by a Flexible Loop Region
Journal of Molecular Biology, 2002
Prediction of functionally important residues based solely on the computed energetics of protein structure 1 1Edited by B. Honig
Journal of Molecular Biology, 2001
The Protein Data Bank
Nucleic Acids Research, 2000
All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of Proteins
The Journal of Physical Chemistry B, 1998
Polar hydrogen positions in proteins: Empirical energy placement and neutron diffraction comparison
Proteins-Structure Function and Bioinformatics, 1988