Genetic Algorithm-Based Protocol for Coupling Digital Filtering and Partial Least-Squares Regression: Application to the Near-Infrared Analysis of Glucose in Biological Matrices
- 1 August 1996
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 68 (15) , 2663-2675
- https://doi.org/10.1021/ac960049g
Abstract
A multivariate calibration procedure is described that is based on the use of a genetic algorithm (GA) to guide the coupling of bandpass digital filtering and partial least-squares (PLS) regression. The measurement of glucose in three different biological matrices with near-infrared spectroscopy is employed to develop this protocol. The GA is employed to optimize the position and width of the bandpass digital filter, the spectral range for PLS regression, and the number of PLS factors used in building the calibration model. The optimization of these variables is difficult because the values of the variables employ different units, resulting in a tendency for local optima to occur on the response surface of the optimization. Two issues are found to be critical to the success of the optimization: the configuration of the GA and the development of an appropriate fitness function. An integer representation for the GA is employed to overcome the difficulty in optimizing variables that are dissimilar, and the optimal GA configuration is found through experimental design methods. Three fitness function calculations are compared for their ability to lead the GA to better calibration models. A fitness function based on the combination of the mean-squared error in the calibration set data, the mean-squared error in the monitoring set data, and the number of PLS factors raised to a weighting factor is found to perform best. Multiple random drawings of the calibration and monitoring sets are also found to improve the optimization performance. Using this fitness function and three random drawings of the calibration and monitoring sets, the GA found calibration models that required fewer PLS factors yet had similar or better prediction abilities compared to calibration models found through an optimization protocol based on a grid search method.Keywords
This publication has 12 references indexed in Scilit:
- Robust regression and outlier detection for non-linear models using genetic algorithmsChemometrics and Intelligent Laboratory Systems, 1995
- Understanding and using genetic algorithms Part 2. Representation, configuration and hybridizationChemometrics and Intelligent Laboratory Systems, 1994
- Temperature-Insensitive Near-Infrared Spectroscopic Measurement of Glucose in Aqueous SolutionsApplied Spectroscopy, 1994
- Genetic algorithms in wavelength selection: a comparative studyAnalytica Chimica Acta, 1994
- Near-infrared spectroscopic measurement of glucose in a protein matrixAnalytical Chemistry, 1993
- Strategies for coupling digital filtering with partial least-squares regression: Application to the determination of glucose in plasma by Fourier-transform near-infrared spectroscopyAnalytical Chemistry, 1993
- The parsimony principle applied to multivariate calibrationAnalytica Chimica Acta, 1993
- Determination of physiological levels of glucose in an aqueous matrix with digitally filtered Fourier transform near-infrared spectraAnalytical Chemistry, 1990
- Analytical applications of Monte Carlo techniquesAnalytical Chemistry, 1990
- Global optimization by simulated annealing with wavelength selection for ultraviolet-visible spectrophotometryAnalytical Chemistry, 1989