Generating Optimal Linear PLS Estimations (GOLPE): An Advanced Chemometric Tool for Handling 3D‐QSAR Problems
- 1 January 1993
- journal article
- research article
- Published by Wiley in Quantitative Structure-Activity Relationships
- Vol. 12 (1) , 9-20
- https://doi.org/10.1002/qsar.19930120103
Abstract
An advanced variable selection procedure, called GOLPE, aimed at obtaining PLS regression models with the highest prediction ability is presented and illustrated with an application in 3D‐QSAR. Key steps in the procedure are a preliminary variable selection by means of a D‐optimal design in the loading space, and an iterative evaluation of the effects of individual variables on the model predictivity based on the validation of a number of reduced models, on variables combinations selected according to a FFD strategy.The procedure is successfully applied to a real 3D‐QSAR case study: the results obtained by GOLPE are compared with those obtained by CoMFA and found to be in good agreement in terms of variable importance, but with a much higher prediction ability. Accordingly, the results encourage to think that it might be used within the CoMFA framework in the place of the present PLS version there, or in CoMFA‐like studies on the structures generated by GRID probes.Keywords
This publication has 24 references indexed in Scilit:
- PLS regression methodsJournal of Chemometrics, 1988
- Crossvalidation, Bootstrapping, and Partial Least Squares Compared with Multiple Regression in Conventional QSAR StudiesQuantitative Structure-Activity Relationships, 1988
- Polychlorinated dibenzofurans (PCDFs): Correlation between in vivo and in vitro structure-activity relationshipsToxicology, 1985
- Estimating Optimal Transformations for Multiple Regression and CorrelationJournal of the American Statistical Association, 1985
- A computational procedure for determining energetically favorable binding sites on biologically important macromoleculesJournal of Medicinal Chemistry, 1985
- Cross-Validatory Choice of the Number of Components From a Principal Component AnalysisTechnometrics, 1982
- Cross-Validatory Estimation of the Number of Components in Factor and Principal Components ModelsTechnometrics, 1978
- SIMCA: A Method for Analyzing Chemical Data in Terms of Similarity and AnalogyPublished by American Chemical Society (ACS) ,1977
- The Predictive Sample Reuse Method with ApplicationsJournal of the American Statistical Association, 1975
- A predictive approach to the random effect modelBiometrika, 1974