Selection of data sets for qsars: Analyses of tetrahymena toxicity from aromatic compounds
- 1 January 2003
- journal article
- research article
- Published by Taylor & Francis in SAR and QSAR in Environmental Research
- Vol. 14 (1) , 59-81
- https://doi.org/10.1080/1062936021000058782
Abstract
The aim of this investigation was to develop a strategy for the formulation of a valid ecotoxicological-based QSAR while, at the same time, minimizing the required number of toxicological data points. Two chemical selection approaches--distance-based optimality and K nearest neighbor (KNN), were used to examine the impact of the number of compounds used in the training and testing phases of QSAR development (i.e. diversity and representivity, respectively) on the predictivity (i.e. external validation) of the QSAR. Regression-based QSARs for the ectotoxic potency for population growth impairment of aromatic compounds (benzenes) to the aquatic ciliate Tetrahymena pyriformis were developed based on descriptors for chemical hydrophobicity and electrophilicity. A ratio of one compound in the training set to three in the test set was applied. The results indicate that from a known chemical universe, in this case 385 derivatives, robust QSARs of equal quality may be developed from a small number of diverse compounds, validated by a representative test set. As a conservative recommendation it is suggested that there should be a minimum of 10 observations for each variable in a QSAR.Keywords
This publication has 25 references indexed in Scilit:
- Structure-Based Classification of Antibacterial ActivityJournal of Chemical Information and Computer Sciences, 2002
- Multivariate Discrimination between Modes of Toxic Action of PhenolsQuantitative Structure-Activity Relationships, 2002
- Development of Quantitative Structure−Property Relationship Models for Early ADME Evaluation in Drug Discovery. 2. Blood-Brain Barrier PenetrationJournal of Chemical Information and Computer Sciences, 2001
- Development of Quantitative Structure−Activity Relationships for the Toxicity of Aromatic Compounds to Tetrahymena pyriformis: Comparative Assessment of the MethodologiesChemical Research in Toxicology, 2001
- Structure-Toxicity Analyses of Tetrahymena Pyriformis Exposed to Pyridines - An Examination Into Extension of Surface-Response DomainsSAR and QSAR in Environmental Research, 2001
- Structure−Toxicity Relationships for Benzenes Evaluated with Tetrahymena pyriformisChemical Research in Toxicology, 1999
- The Information Content of 2D and 3D Structural Descriptors Relevant to Ligand-Receptor BindingJournal of Chemical Information and Computer Sciences, 1997
- Multivariate design and modeling in QSARChemometrics and Intelligent Laboratory Systems, 1996
- Quantum‐chemical Descriptors for Estimating the Acute Toxicity of Substituted Benzenes to the Guppy (Poecilia reticulata) and Fathead Minnow (Pimephales promelas)Quantitative Structure-Activity Relationships, 1996
- Chance correlations in structure-activity studies using multiple regression analysisJournal of Medicinal Chemistry, 1972