New Methods for Ligand-Based Virtual Screening: Use of Data Fusion and Machine Learning to Enhance the Effectiveness of Similarity Searching
- 4 January 2006
- journal article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 46 (2) , 462-470
- https://doi.org/10.1021/ci050348j
Abstract
Similarity searching using a single bioactive reference structure is a well-established technique for accessing chemical structure databases. This paper describes two extensions of the basic approach. First, we discuss the use of group fusion to combine the results of similarity searches when multiple reference structures are available. We demonstrate that this technique is notably more effective than conventional similarity searching in scaffold-hopping searches for structurally diverse sets of active molecules; conversely, the technique will do little to improve the search performance if the actives are structurally homogeneous. Second, we make the assumption that the nearest neighbors resulting from a similarity search, using a single bioactive reference structure, are also active and use this assumption to implement approximate forms of group fusion, substructural analysis, and binary kernel discrimination. This approach, called turbo similarity searching, is notably more effective than conventional similarity searching.Keywords
This publication has 26 references indexed in Scilit:
- Enhancing the Effectiveness of Similarity-Based Virtual Screening Using Nearest-Neighbor InformationJournal of Medicinal Chemistry, 2005
- “Lead Hopping”. Validation of Topomer Similarity as a Superior Predictor of Similar Biological ActivitiesJournal of Medicinal Chemistry, 2004
- Molecular similarity: a key technique in molecular informaticsOrganic & Biomolecular Chemistry, 2004
- Comparison of topological descriptors for similarity-based virtual screening using multiple bioactive reference structuresOrganic & Biomolecular Chemistry, 2004
- Enhancing the Effectiveness of Virtual Screening by Fusing Nearest Neighbor Lists: A Comparison of Similarity CoefficientsJournal of Chemical Information and Computer Sciences, 2004
- Development of a Method for Evaluating Drug-Likeness and Ease of Synthesis Using a Data Set in Which Compounds Are Assigned Scores Based on Chemists' IntuitionJournal of Chemical Information and Computer Sciences, 2003
- Comparison of Ranking Methods for Virtual Screening in Lead-Discovery ProgramsJournal of Chemical Information and Computer Sciences, 2002
- Scaffold Searching: Automated Identification of Similar Ring Systems for the Design of Combinatorial LibrariesQuantitative Structure-Activity Relationships, 2002
- Neighborhood Behavior: A Useful Concept for Validation of “Molecular Diversity” DescriptorsJournal of Medicinal Chemistry, 1996
- Atom pairs as molecular features in structure-activity studies: definition and applicationsJournal of Chemical Information and Computer Sciences, 1985