Onion design and its application to a pharmaceutical QSAR problem
- 1 March 2004
- journal article
- research article
- Published by Wiley in Journal of Chemometrics
- Vol. 18 (3-4) , 188-202
- https://doi.org/10.1002/cem.854
Abstract
Statistical molecular design (SMD) is an efficient tool for selecting informative, representative and diverse sets of molecular structures to be used in conjunction with QSAR, combinatorial technologies and other areas of research depending on optimization of molecular properties. Onion design represents a recent addition to the plethora of designs encountered in the SMD toolbox. It is a flexible design approach relying on a combination of the best properties of other design families, notably the model support property of D‐optimal design and the uniform coverage ability of space‐filling design. The onion design splits the candidate set into a number of subsets (‘shells’ or ‘layers’), and a D‐optimal selection is made from each shell. This makes it possible to select representative sets of molecular structures throughout any property space with reasonable design sizes. The number of selected molecules is easily controlled by varying (i) the number of shells and (ii) the model on which the design is based. The applicability of onion design to a pharmaceutical QSAR problem is reported. The example data set contains 967 drug‐like molecules. The biological activity under investigation is the inhibition of the major human drug‐metabolizing enzyme cytochrome P450 3A4. Onion design is used to select an informative training set. QSAR modeling is accomplished by means of multivariate data analysis tools. Copyright © 2004 John Wiley & Sons, Ltd.Keywords
This publication has 26 references indexed in Scilit:
- Exploring organic synthetic experimental proceduresPublished by Springer Nature ,2007
- Hierarchical experimental design exemplified by QSAR evaluation of a chemical library directed towards the melanocortin 4 receptorJournal of Chemometrics, 2002
- Uniform Coverage Designs for Molecule SelectionTechnometrics, 2002
- Fully automated analysis of activities catalysed by the major human liver cytochrome P450 (CYP) enzymes: assessment of human CYP inhibition potentialXenobiotica, 1999
- Hierarchical multiblock PLS and PC models for easier model interpretation and as an alternative to variable selectionJournal of Chemometrics, 1996
- Parameter Based Methods for Compound Selection from Chemical DatabasesQuantitative Structure-Activity Relationships, 1996
- A Fast Algorithm For Selecting Sets Of Dissimilar Molecules From Large Chemical DatabasesQuantitative Structure-Activity Relationships, 1995
- Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular modelJournal of the American Chemical Society, 1985
- Screening of Suitable Solvents in Organic Synthesis. Strategies for Solvent Selection.Acta Chemica Scandinavica, 1985
- Cross-Validatory Estimation of the Number of Components in Factor and Principal Components ModelsTechnometrics, 1978