Development and Validation of k-Nearest-Neighbor QSPR Models of Metabolic Stability of Drug Candidates
- 30 May 2003
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Medicinal Chemistry
- Vol. 46 (14) , 3013-3020
- https://doi.org/10.1021/jm020491t
Abstract
Computational ADME (absorption, distribution, metabolism, and excretion) models may be used early in the drug discovery process in order to flag drug candidates with potentially problematic ADME profiles. We report the development, validation, and application of quantitative structure−property relationship (QSPR) models of metabolic turnover rate for compounds in human S9 homogenate. Biological data were obtained from uniform bioassays of 631 diverse chemicals proprietary to GlaxoSmithKline (GSK). The models were built with topological molecular descriptors such as molecular connectivity indices or atom pairs using the k-nearest neighbor variable selection optimization method developed at the University of North Carolina (Zheng, W.; Tropsha, A. A novel variable selection QSAR approach based on the k-nearest neighbor principle. J. Chem. Inf. Comput. Sci., 2000, 40, 185−194.). For the purpose of validation, the whole data set was divided into training and test sets. The training set QSPR models were characterized by high internal accuracy with leave-one-out cross-validated R2 (q2) values ranging between 0.5 and 0.6. The test set compounds were correctly classified as stable or unstable in S9 assay with an accuracy above 85%. These models were additionally validated by in silico metabolic stability screening of 107 new chemicals under development in several drug discovery programs at GSK. One representative model generated with MolConnZ descriptors predicted 40 compounds to be metabolically stable (turnover rate less than 25%), and 33 of them were indeed found to be stable experimentally. This success (83% concordance) in correctly picking chemicals that are metabolically stable in the human S9 homogenate spells a rapid, computational screen for generating components of the ADME profile in a drug discovery process.Keywords
This publication has 17 references indexed in Scilit:
- Novel Variable Selection Quantitative Structure−Property Relationship Approach Based on thek-Nearest-Neighbor PrincipleJournal of Chemical Information and Computer Sciences, 1999
- Recognizing molecules with drug-like propertiesCurrent Opinion in Chemical Biology, 1999
- Managing the drug discovery/development interfaceDrug Discovery Today, 1997
- Applications of the radius-diameter diagram to the classification of topological and geometrical shapes of chemical compoundsJournal of Chemical Information and Computer Sciences, 1992
- A comment on nomenclature and the unsaturated bondJournal of Chemical Information and Computer Sciences, 1991
- Hydrophobic Properties of Chromones and Flavones. Relationships Between Octanol/Water Partition Coefficients and RP-HPLC Capacity FactorsQuantitative Structure-Activity Relationships, 1987
- Atom pairs as molecular features in structure-activity studies: definition and applicationsJournal of Chemical Information and Computer Sciences, 1985
- Isomer discrimination by topological information approachJournal of Computational Chemistry, 1981
- Characterization of molecular branchingJournal of the American Chemical Society, 1975
- Structural Determination of Paraffin Boiling PointsJournal of the American Chemical Society, 1947