Improving the Odds in Discriminating “Drug-like” from “Non Drug-like” Compounds
- 4 October 2000
- journal article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 40 (6) , 1315-1324
- https://doi.org/10.1021/ci0003810
Abstract
We have used a feed-forward neural network technique to classify chemical compounds into potentially "drug-like" and "non drug-like" candidates. The neural network was trained to distinguish between a set of "drug-like" and "non drug-like" chemical compounds taken from the MACCS-II Drug Data Report (MDDR) and the Available Chemicals Directory (ACD). The 2D atom types (of the full atomic representation) were assigned and applied as descriptors to encode numerically each compound. There are four main conclusions: First the method performs well, correctly assigning 88% of the compounds in both MDDR and ACD. Improved discrimination was achieved by a more critical selection of training sets. Second, the method gives much better prediction performance than the widely used "Rule of Five", which accepts as many as 74% of the ACD compounds but only 66% of those in MDDR, resulting in a correlation coefficient which is effectively zero, compared to a value of 0.63 for the neural network prediction. Third, based on a standard Tanimoto similarity search the selection of drug-like compounds in the evaluation set is not biased toward compounds similar to those in the training set. Fourth, the trained neural network was applied to evaluate the drug-likeness of 136 GABA uptake inhibitors with impressive results. The implications of applying a neural network to characterize chemical compounds are discussed.Keywords
This publication has 11 references indexed in Scilit:
- A Scoring Scheme for Discriminating between Drugs and NondrugsJournal of Medicinal Chemistry, 1998
- Can We Learn To Distinguish between “Drug-like” and “Nondrug-like” Molecules?Journal of Medicinal Chemistry, 1998
- Identification and control of anaerobic digesters using adaptive, on-line trained neural networksComputers & Chemical Engineering, 1997
- Automated analysis of nuclear magnetic resonance assignments for proteinsCurrent Opinion in Structural Biology, 1995
- Applications of Combinatorial Technologies to Drug Discovery. 2. Combinatorial Organic Synthesis, Library Screening Strategies, and Future DirectionsJournal of Medicinal Chemistry, 1994
- Applications of Combinatorial Technologies to Drug Discovery. 1. Background and Peptide Combinatorial LibrariesJournal of Medicinal Chemistry, 1994
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Applications of neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitorsJournal of Medicinal Chemistry, 1991
- Prediction of human mRNA donor and acceptor sites from the DNA sequenceJournal of Molecular Biology, 1991
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988