Prediction of Aqueous Solubility of Heteroatom-Containing Organic Compounds from Molecular Structure
- 17 July 2001
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 41 (5) , 1237-1247
- https://doi.org/10.1021/ci010035y
Abstract
The use of quantitative structure−property relationships (QSPRs) to predict aqueous solubilities (log S) of heteroatom-containing organic compounds from their molecular structure is presented. Three data sets are examined. Data set 1 contains 176 compounds having one or more nitrogen atoms with some oxygen (log S[mol/L] range is −7.41 to 0.96). Data set 2 contains 223 compounds having one or more oxygen atoms, with no nitrogen (log S[mol/L] range is −8.77 to 1.57). Data set 3 contains all 399 compounds from sets 1 and 2 (log S/mol/L] range is −8.77 to 1.57). After descriptor generation and feature selection, multiple linear regression (MLR) and computational neural network (CNN) models are developed for aqueous solubility prediction. The best results were obtained with nonlinear CNN models. Root-mean-square (rms) errors for training with the three data sets ranged from 0.3 to 0.6 log units. All models were validated with external prediction sets, with the rms errors ranging from 0.6 log units to 1.5 log units.Keywords
This publication has 27 references indexed in Scilit:
- Quantitative Structure−Property Relationships for the Prediction of Vapor Pressures of Organic Compounds from Molecular StructuresJournal of Chemical Information and Computer Sciences, 2000
- Prediction of Aqueous Solubility for a Diverse Set of Heteroatom-Containing Organic Compounds Using a Quantitative Structure−Property RelationshipJournal of Chemical Information and Computer Sciences, 1996
- Automated Descriptor Selection for Quantitative Structure-Activity Relationships Using Generalized Simulated AnnealingJournal of Chemical Information and Computer Sciences, 1995
- Suitability of the PM3‐derived molecular electrostatic potentialsJournal of Computational Chemistry, 1993
- Statistics using neural networks: chance effectsJournal of Medicinal Chemistry, 1993
- Atomic charge calculations for quantitative structure—property relationshipsJournal of Computational Chemistry, 1992
- Molecular shape and the prediction of high-performance liquid chromatographic retention indexes of polycyclic aromatic hydrocarbonsAnalytical Chemistry, 1987
- A simple method for the representation, quantification, and comparison of the volumes and shapes of chemical compoundsJournal of Chemical Information and Computer Sciences, 1986
- On molecular identification numbersJournal of Chemical Information and Computer Sciences, 1984
- Studies of Chemical Structure-Biological Activity Relations Using Pattern RecognitionPublished by American Chemical Society (ACS) ,1979