Supervised Self-Organizing Maps in Drug Discovery. 1. Robust Behavior with Overdetermined Data Sets
- 4 October 2005
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 45 (6) , 1749-1758
- https://doi.org/10.1021/ci0500839
Abstract
The utility of the supervised Kohonen self-organizing map was assessed and compared to several statistical methods used in QSAR analysis. The self-organizing map (SOM) describes a family of nonlinear, topology preserving mapping methods with attributes of both vector quantization and clustering that provides visualization options unavailable with other nonlinear methods. In contrast to most chemometric methods, the supervised SOM (sSOM) is shown to be relatively insensitive to noise and feature redundancy. Additionally, sSOMs can make use of descriptors having only nominal linear correlation with the target property. Results herein are contrasted to partial least squares, stepwise multiple linear regression, the genetic functional algorithm, and genetic partial least squares, collectively referred to throughout as the “standard methods”. The k-nearest neighbor (kNN) classification method was also performed to provide a direct comparison with a different classification method. The widely studied dihydrofolate reductase (DHFR) inhibition data set of Hansch and Silipo is used to evaluate the ability of sSOMs to classify unknowns as a function of increasing class resolution. The contribution of the sSOM neighborhood kernel to its predictive ability is assessed in two experiments: (1) training with the k-means clustering limit, where the neighborhood radius is zero throughout the training regimen, and (2) training the sSOM until the neighborhood radius is reduced to zero. Results demonstrate that sSOMs provide more accurate predictions than standard linear QSAR methods.Keywords
This publication has 10 references indexed in Scilit:
- The Use of Self-organizing Neural Networks in Drug DesignPublished by Springer Nature ,2005
- An Integrated SOM-Fuzzy ARTMAP Neural System for the Evaluation of ToxicityJournal of Chemical Information and Computer Sciences, 2002
- Self-Organizing MapsPublished by Springer Nature ,2001
- Neural Networks in ChemistryAngewandte Chemie International Edition in English, 1993
- Genetic algorithms as a strategy for feature selectionJournal of Chemometrics, 1992
- Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and lignin sulfonateAnalytical Chemistry, 1983
- Correlation analysis. Its application to the structure-activity relation of triazines inhibiting dihydrofolate reductaseJournal of the American Chemical Society, 1975
- Quantitative approach to biochemical structure-activity relationshipsAccounts of Chemical Research, 1969
- Nearest neighbor pattern classificationIEEE Transactions on Information Theory, 1967
- Adaptive Control ProcessesPublished by Walter de Gruyter GmbH ,1961