DBD-Hunter: a knowledge-based method for the prediction of DNA–protein interactions
Open Access
- 31 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (12) , 3978-3992
- https://doi.org/10.1093/nar/gkn332
Abstract
The structures of DNA–protein complexes have illuminated the diversity of DNA–protein binding mechanisms shown by different protein families. This lack of generality could pose a great challenge for predicting DNA–protein interactions. To address this issue, we have developed a knowledge-based method, DNA-binding Domain Hunter (DBD-Hunter), for identifying DNA-binding proteins and associated binding sites. The method combines structural comparison and the evaluation of a statistical potential, which we derive to describe interactions between DNA base pairs and protein residues. We demonstrate that DBD-Hunter is an accurate method for predicting DNA-binding function of proteins, and that DNA-binding protein residues can be reliably inferred from the corresponding templates if identified. In benchmark tests on ∼4000 proteins, our method achieved an accuracy of 98% and a precision of 84%, which significantly outperforms three previous methods. We further validate the method on DNA-binding protein structures determined in DNA-free (apo) state. We show that the accuracy of our method is only slightly affected on apo-structures compared to the performance on holo-structures cocrystallized with DNA. Finally, we apply the method to ∼1700 structural genomics targets and predict that 37 targets with previously unknown function are likely to be DNA-binding proteins. DBD-Hunter is freely available at http://cssb.biology.gatech.edu/skolnick/webservice/DBD-Hunter/.Keywords
This publication has 67 references indexed in Scilit:
- Predicting protein function from sequence and structureNature Reviews Molecular Cell Biology, 2007
- Residue‐level prediction of DNA‐binding sites and its application on DNA‐binding protein predictionsFEBS Letters, 2007
- Energetics of protein–DNA interactionsNucleic Acids Research, 2007
- Efficient Prediction of Nucleic Acid Binding Function from Low-resolution Protein StructuresJournal of Molecular Biology, 2006
- The NMR Solution Structure of a Mutant of the Max b/HLH/LZ Free of DNA: Insights into the Specific and Reversible DNA Binding Mechanism of Dimeric Transcription FactorsJournal of Molecular Biology, 2004
- Plasmid transcriptional repressor CopG oligomerises to render helical superstructures unbound and in complexes with oligonucleotidesJournal of Molecular Biology, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- VMD: Visual molecular dynamicsJournal of Molecular Graphics, 1996
- Crystal Structure of a p53 Tumor Suppressor-DNA Complex: Understanding Tumorigenic MutationsScience, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994