Evaluation of the performance of 3D virtual screening protocols: RMSD comparisons, enrichment assessments, and decoy selection—What can we learn from earlier mistakes?
Top Cited Papers
- 15 January 2008
- journal article
- research article
- Published by Springer Nature in Journal of Computer-Aided Molecular Design
- Vol. 22 (3-4) , 213-228
- https://doi.org/10.1007/s10822-007-9163-6
Abstract
Within the last few years a considerable amount of evaluative studies has been published that investigate the performance of 3D virtual screening approaches. Thereby, in particular assessments of protein–ligand docking are facing remarkable interest in the scientific community. However, comparing virtual screening approaches is a non-trivial task. Several publications, especially in the field of molecular docking, suffer from shortcomings that are likely to affect the significance of the results considerably. These quality issues often arise from poor study design, biasing, by using improper or inexpressive enrichment descriptors, and from errors in interpretation of the data output. In this review we analyze recent literature evaluating 3D virtual screening methods, with focus on molecular docking. We highlight problematic issues and provide guidelines on how to improve the quality of computational studies. Since 3D virtual screening protocols are in general assessed by their ability to discriminate between active and inactive compounds, we summarize the impact of the composition and preparation of test sets on the outcome of evaluations. Moreover, we investigate the significance of both classic enrichment parameters and advanced descriptors for the performance of 3D virtual screening methods. Furthermore, we review the significance and suitability of RMSD as a measure for the accuracy of protein–ligand docking algorithms and of conformational space sub sampling algorithms.Keywords
This publication has 80 references indexed in Scilit:
- Benchmarking Sets for Molecular DockingJournal of Medicinal Chemistry, 2006
- Current Status of Virtual Combinatorial Library DesignQSAR & Combinatorial Science, 2005
- Determining the Validity of a QSAR Model − A Classification ApproachJournal of Chemical Information and Modeling, 2004
- FlexE: efficient molecular docking considering protein structure variationsJournal of Molecular Biology, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- Structure-based identification of a ricin inhibitorJournal of Molecular Biology, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- Molecular recognition of receptor sites using a genetic algorithm with a description of desolvationJournal of Molecular Biology, 1995
- A new model for the agonistic binding site on the histamine H2-receptor: The catalytic triad in serine proteases as a model for the binding site ofJournal of Molecular Graphics, 1994