Comparing protein–ligand docking programs is difficult
Top Cited Papers
- 3 June 2005
- journal article
- review article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 60 (3) , 325-332
- https://doi.org/10.1002/prot.20497
Abstract
There is currently great interest in comparing protein-ligand docking programs. A review of recent comparisons shows that it is difficult to draw conclusions of general applicability. Statistical hypothesis testing is required to ensure that differences in pose-prediction success rates and enrichment rates are significant. Numerical measures such as root-mean-square deviation need careful interpretation and may profitably be supplemented by interaction-based measures and visual inspection of dockings. Test sets must be of appropriate diversity and of good experimental reliability. The effects of crystal-packing interactions may be important. The method used for generating starting ligand geometries and positions may have an appreciable effect on docking results. For fair comparison, programs must be given search problems of equal complexity (e.g. binding-site regions of the same size) and approximately equal time in which to solve them. Comparisons based on rescoring require local optimization of the ligand in the space of the new objective function. Re-implementations of published scoring functions may give significantly different results from the originals. Ostensibly minor details in methodology may have a profound influence on headline success rates.Keywords
This publication has 42 references indexed in Scilit:
- Ph4Dock: Pharmacophore-Based Protein−Ligand DockingJournal of Medicinal Chemistry, 2004
- Comparative evaluation of eight docking tools for docking and virtual screening accuracyProteins-Structure Function and Bioinformatics, 2004
- Improved protein–ligand docking using GOLDProteins-Structure Function and Bioinformatics, 2003
- A new test set for validating predictions of protein–ligand interactionProteins-Structure Function and Bioinformatics, 2002
- Knowledge-based scoring function to predict protein-ligand interactionsJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Contact area difference (CAD): a robust measure to evaluate accuracy of protein modelsJournal of Molecular Biology, 1997
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- Molecular recognition of receptor sites using a genetic algorithm with a description of desolvationJournal of Molecular Biology, 1995