Virtual Screening of Molecular Databases Using a Support Vector Machine
- 16 April 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 45 (3) , 549-561
- https://doi.org/10.1021/ci049641u
Abstract
The Support Vector Machine (SVM) is an algorithm that derives a model used for the classification of data into two categories and which has good generalization properties. This study applies the SVM algorithm to the problem of virtual screening for molecules with a desired activity. In contrast to typical applications of the SVM, we emphasize not classification but enrichment of actives by using a modified version of the standard SVM function to rank molecules. The method employs a simple and novel criterion for picking molecular descriptors and uses cross-validation to select SVM parameters. The resulting method is more effective at enriching for active compounds with novel chemistries than binary fingerprint-based methods such as binary kernel discrimination.Keywords
This publication has 22 references indexed in Scilit:
- Feature Selection in MLPs and SVMs Based on Maximum Output InformationIEEE Transactions on Neural Networks, 2004
- Glide: A New Approach for Rapid, Accurate Docking and Scoring. 2. Enrichment Factors in Database ScreeningJournal of Medicinal Chemistry, 2004
- Drug Discovery Using Support Vector Machines. The Case Studies of Drug-likeness, Agrochemical-likeness, and Enzyme Inhibition PredictionsJournal of Chemical Information and Computer Sciences, 2003
- Comparison of Linear and Nonlinear Classification Algorithms for the Prediction of Drug and Chemical Metabolism by Human UDP-Glucuronosyltransferase IsoformsJournal of Chemical Information and Computer Sciences, 2003
- Active Learning with Support Vector Machines in the Drug Discovery ProcessJournal of Chemical Information and Computer Sciences, 2003
- Comparison of Ranking Methods for Virtual Screening in Lead-Discovery ProgramsJournal of Chemical Information and Computer Sciences, 2002
- Improvements to Platt's SMO Algorithm for SVM Classifier DesignNeural Computation, 2001
- Chemical Similarity SearchingJournal of Chemical Information and Computer Sciences, 1998
- From atoms and bonds to three-dimensional atomic coordinates: automatic model buildersChemical Reviews, 1993
- Description of several chemical structure file formats used by computer programs developed at Molecular Design LimitedJournal of Chemical Information and Computer Sciences, 1992