How Does Consensus Scoring Work for Virtual Library Screening? An Idealized Computer Experiment
- 17 July 2001
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 41 (5) , 1422-1426
- https://doi.org/10.1021/ci010025x
Abstract
It has been reported recently that consensus scoring, which combines multiple scoring functions in binding affinity estimation, leads to higher hit-rates in virtual library screening studies. This method seems quite independent to the target receptor, the docking program, or even the scoring functions under investigation. Here we present an idealized computer experiment to explore how consensus scoring works. A hypothetical set of 5000 compounds is used to represent a chemical library under screening. The binding affinities of all its member compounds are assigned by mimicking a real situation. Based on the assumption that the error of a scoring function is a random number in a normal distribution, the predicted binding affinities were generated by adding such a random number to the “observed” binding affinities. The relationship between the hit-rates and the number of scoring functions employed in scoring was then investigated. The performance of several typical ranking strategies for a consensus scoring procedure was also explored. Our results demonstrate that consensus scoring outperforms any single scoring for a simple statistical reason: the mean value of repeated samplings tends to be closer to the true value. Our results also suggest that a moderate number of scoring functions, three or four, are sufficient for the purpose of consensus scoring. As for the ranking strategy, both the rank-by-number and the rank-by-rank strategy work more effectively than the rank-by-vote strategy.Keywords
This publication has 11 references indexed in Scilit:
- Detailed Analysis of Scoring Functions for Virtual ScreeningJournal of Medicinal Chemistry, 2001
- Protein-Based Virtual Screening of Chemical Databases. 1. Evaluation of Different Docking/Scoring CombinationsJournal of Medicinal Chemistry, 2000
- Drug Discovery: A Historical PerspectiveScience, 2000
- Knowledge-based scoring function to predict protein-ligand interactionsJournal of Molecular Biology, 2000
- Consensus Scoring: A Method for Obtaining Improved Hit Rates from Docking Databases of Three-Dimensional Structures into ProteinsJournal of Medicinal Chemistry, 1999
- Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy functionJournal of Computational Chemistry, 1998
- Virtual screening—an overviewDrug Discovery Today, 1998
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- A geometric approach to macromolecule-ligand interactionsJournal of Molecular Biology, 1982