Hit-Directed Nearest-Neighbor Searching
- 16 December 2004
- journal article
- Published by American Chemical Society (ACS) in Journal of Medicinal Chemistry
- Vol. 48 (1) , 240-248
- https://doi.org/10.1021/jm0493515
Abstract
This work describes a practical strategy used at Pharmacia for identifying compounds for follow-up screening following an initial HTS campaign against targets where no 3-D structural information is available and preliminary SAR models do not exist. The approach explicitly takes into account different representations of chemistry space and identifies compounds for follow-up screening that are likely to provide the best overall coverage of the chemistry spaces considered. Specifically, the work employs hit-directed nearest-neighbor (HDNN) searching of compound databases based upon a set of “probe compounds” obtained as hits in the preliminary high-throughput screens. Four different molecular representations that generate nearly unique chemistry spaces are used. The representations include 3-D, 2-D, 2-D topological BCUTs (2-DT) and molecular fingerprints derived from substructural fragments. In the case of the BCUT representations the NN searching is distance based, while in the case of molecular fingerprints a similarity-based measure is used. Generally, the results obtained differ significantly among all four methods, that is, the sets of NN compounds have surprisingly little overlap. Moreover, in all of the four chemistry space representations, a minimum of 3- to 4-fold enrichment in actives over random screening is observed even though the actives identified in each of the sets of NNs are in large measure unique. These results suggest that use of multiple searches based upon a variety of molecular representations provides an effective way of identifying more hits in HDNN searches of chemistry spaces than can be realized with single searches.Keywords
This publication has 12 references indexed in Scilit:
- Conditional Probability: A New Fusion Method for Merging Disparate Virtual Screening ResultsJournal of Chemical Information and Computer Sciences, 2004
- Combination of Fingerprint-Based Similarity Coefficients Using Data FusionJournal of Chemical Information and Computer Sciences, 2002
- Structure-based virtual screening: an overviewPublished by Elsevier ,2002
- Why do we need so many chemical similarity search methods?Drug Discovery Today, 2002
- The Elements of Statistical LearningPublished by Springer Nature ,2001
- Classification of Kinase Inhibitors Using BCUT DescriptorsJournal of Chemical Information and Computer Sciences, 2000
- Design and Diversity Analysis of Large Combinatorial Libraries Using Cell-Based MethodsJournal of Chemical Information and Computer Sciences, 1999
- Metric Validation and the Receptor-Relevant Subspace ConceptJournal of Chemical Information and Computer Sciences, 1999
- Virtual screening—an overviewDrug Discovery Today, 1998
- Molecular identification number for substructure searchesJournal of Chemical Information and Computer Sciences, 1989