GA Strategy for Variable Selection in QSAR Studies: Application of GA-Based Region Selection to a 3D-QSAR Study of Acetylcholinesterase Inhibitors
- 14 November 1998
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 39 (1) , 112-120
- https://doi.org/10.1021/ci980088o
Abstract
Comparative molecular field analysis (CoMFA) with partial least squares (PLS) is one of the most frequently used tools in three-dimensional quantitative structure−activity relationships (3D-QSAR) studies. Although many successful CoMFA applications have proved the value of this approach, there are some problems in its proper application. Especially, the inability of PLS to handle the low signal-to-noise ratio (sample-to-variable ratio) has attracted much attention from QSAR researchers as an exciting research target, and several variable selection methods have been proposed. More recently, we have developed a novel variable selection method for CoMFA modeling (GARGS: genetic algorithm-based region selection), and its utility has been demonstrated in the previous paper (Kimura, T., et al. J. Chem. Inf. Comput. Sci.1998, 38, 276−282). The purpose of this study is to evaluate whether GARGS can pinpoint known molecular interactions in 3D space. We have used a published set of acetylcholinesterase (AChE) inhibitors as a test example. By applying GARGS to a data set of AChE inhibitors, several improved models with high internal prediction and low number of field variables were obtained. External validation was performed to select a final model among them. The coefficient contour maps of the final GARGS model were compared with the properties of the active site in AChE and the consistency between them was evaluated.Keywords
This publication has 13 references indexed in Scilit:
- GA Strategy for Variable Selection in QSAR Studies: GA-Based Region Selection for CoMFA ModelingJournal of Chemical Information and Computer Sciences, 1998
- GA Strategy for Variable Selection in QSAR Studies: GA-Based PLS Analysis of Calcium Channel AntagonistsJournal of Chemical Information and Computer Sciences, 1997
- 3D-QSAR: a current perspectiveTrends in Pharmacological Sciences, 1995
- Structure-based drug design: progress, results and challengesStructure, 1994
- Variable Selection in QSAR Studies. II. A Highly Efficient Combination of Systematic Search and EvolutionQuantitative Structure-Activity Relationships, 1994
- Application of a genetic algorithm to feature selection under full validation conditions and to outlier detectionJournal of Chemometrics, 1994
- On the prediction of binding properties of drug molecules by comparative molecular field analysisJournal of Medicinal Chemistry, 1993
- The Probability of Chance Correlation Using Partial Least Squares (PLS)Quantitative Structure-Activity Relationships, 1993
- Genetic algorithms as a strategy for feature selectionJournal of Chemometrics, 1992
- A computational procedure for determining energetically favorable binding sites on biologically important macromoleculesJournal of Medicinal Chemistry, 1985