Multivariate Association of Graph-Theoretic Variables and Physicochemical Properties

Abstract
We used canonical correlation analysis to examine the multivariate association between two distinct data sets commonly measured or calculated for approximately 600 chemicals: (1) measured or calculated values of select physieochemical properties (i.e., K ow, boiling point, heat of vaporization, molecular weight, water solubility, molecular volume, hydrogen bonding potential, and vapor pressure) and (2) calculated algorithmically-derived variables (i.e., topological and neighborhood indices derived from graph theory). Canonical correlation analysis identified eight highly significant associations between linear combinations of graph-theoretic variables and linear combinations of physicochemical properties. The set of graph theoretic variables was significantly related to all physieochemical properties, explaining 55% to 99% of the variation in these properties.