Structure-based prediction of DNA-binding proteins by structural alignment and a volume-fraction corrected DFIRE-based energy function
Open Access
- 4 June 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 26 (15) , 1857-1863
- https://doi.org/10.1093/bioinformatics/btq295
Abstract
Motivation: Template-based prediction of DNA binding proteins requires not only structural similarity between target and template structures but also prediction of binding affinity between the target and DNA to ensure binding. Here, we propose to predict protein–DNA binding affinity by introducing a new volume-fraction correction to a statistical energy function based on a distance-scaled, finite, ideal-gas reference (DFIRE) state. Results: We showed that this energy function together with the structural alignment program TM-align achieves the Matthews correlation coefficient (MCC) of 0.76 with an accuracy of 98%, a precision of 93% and a sensitivity of 64%, for predicting DNA binding proteins in a benchmark of 179 DNA binding proteins and 3797 non-binding proteins. The MCC value is substantially higher than the best MCC value of 0.69 given by previous methods. Application of this method to 2235 structural genomics targets uncovered 37 as DNA binding proteins, 27 (73%) of which are putatively DNA binding and only 1 protein whose annotated functions do not contain DNA binding, while the remaining proteins have unknown function. The method provides a highly accurate and sensitive technique for structure-based prediction of DNA binding proteins. Availability: The method is implemented as a part of the Structure-based function-Prediction On-line Tools (SPOT) package available at http://sparks.informatics.iupui.edu/spot Contact:yqzhou@iupui.eduKeywords
This publication has 30 references indexed in Scilit:
- Exploration of Uncharted Regions of the Protein UniversePLoS Biology, 2009
- An all‐atom knowledge‐based energy function for protein‐DNA threading, docking decoy discrimination, and prediction of transcription‐factor binding profilesProteins-Structure Function and Bioinformatics, 2009
- The Rough Guide to In Silico Function Prediction, or How To Use Sequence and Structure Information To Predict Protein FunctionPLoS Computational Biology, 2008
- Prediction of TF target sites based on atomistic models of protein-DNA complexesBMC Bioinformatics, 2008
- Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all‐atom statistical energy functionsProtein Science, 2008
- DBD-Hunter: a knowledge-based method for the prediction of DNA–protein interactionsNucleic Acids Research, 2008
- Predicting protein function from sequence and structureNature Reviews Molecular Cell Biology, 2007
- Learning to Translate Sequence and Structure to Function: Identifying DNA Binding and Membrane Binding ProteinsAnnals of Biomedical Engineering, 2007
- DISPLAR: an accurate method for predicting DNA-binding sites on protein surfacesNucleic Acids Research, 2007
- Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability predictionProtein Science, 2002