Prediction of P-Glycoprotein Substrates by a Support Vector Machine Approach
- 8 May 2004
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 44 (4) , 1497-1505
- https://doi.org/10.1021/ci049971e
Abstract
P-glycoproteins (P-gp) actively transport a wide variety of chemicals out of cells and function as drug efflux pumps that mediate multidrug resistance and limit the efficacy of many drugs. Methods for facilitating early elimination of potential P-gp substrates are useful for facilitating new drug discovery. A computational ensemble pharmacophore model has recently been used for the prediction of P-gp substrates with a promising accuracy of 63%. It is desirable to extend the prediction range beyond compounds covered by the known pharmacophore models. For such a purpose, a machine learning method, support vector machine (SVM), was explored for the prediction of P-gp substrates. A set of 201 chemical compounds, including 116 substrates and 85 nonsubstrates of P-gp, was used to train and test a SVM classification system. This SVM system gave a prediction accuracy of at least 81.2% for P-gp substrates based on two different evaluation methods, which is substantially improved against that obtained from the multiple-pharmacophore model. The prediction accuracy for nonsubstrates of P-gp is 79.2% using 5-fold cross-validation. These accuracies are slightly better than those obtained from other statistical classification methods, including k-nearest neighbor (k-NN), probabilistic neural networks (PNN), and C4.5 decision tree, that use the same sets of data and molecular descriptors. Our study indicates the potential of SVM in facilitating the prediction of P-gp substrates.Keywords
This publication has 27 references indexed in Scilit:
- Structure and mechanism of ABC transportersCurrent Opinion in Structural Biology, 2002
- A Computational Ensemble Pharmacophore Model for Identifying Substrates of P-GlycoproteinJournal of Medicinal Chemistry, 2002
- Classification of Multidrug-Resistance Reversal Agents Using Structure-Based Descriptors and Linear Discriminant AnalysisJournal of Medicinal Chemistry, 2000
- Handbook of Molecular DescriptorsPublished by Wiley ,2000
- BIOCHEMICAL, CELLULAR, AND PHARMACOLOGICAL ASPECTS OF THE MULTIDRUG TRANSPORTERAnnual Review of Pharmacology and Toxicology, 1999
- A general pattern for substrate recognition by P‐glycoproteinEuropean Journal of Biochemistry, 1998
- Quantitative Structure-Activity Relationship of Multidrug Resistance Reversal AgentsMolecular Pharmacology, 1997
- P-glycoprotein and multidrug resistanceCurrent Opinion in Genetics & Development, 1996
- The Nature of Statistical Learning TheoryPublished by Springer Nature ,1995
- Traditional topological indexes vs electronic, geometrical, and combined molecular descriptors in QSAR/QSPR researchJournal of Chemical Information and Computer Sciences, 1993