Abstract
As a consequence of recent advances in the field of High Throughput Screening, the systematic testing ("in vitro profiling") of compounds against a panel of targets covering different therapeutic areas is nowadays used to generate relevant information with respect to the in vivo behavior of drug candidates. However, the development of chemoinformatics tools required for the exploitation of such data is yet in an incipient phase. In this paper, a formalism for the analysis of activity profile vectors (describing the experimental responses of compounds in each of the considered activity tests) is introduced and applied at the study of Neighborhood Behavior (NB; the hypothesis that structurally similar compounds display similar biological properties) of molecular similarity metrics. The experimental activity profiles define an Activity Space in which more than 500 drugs and reference compounds are positioned, their coordinates being inhibitory propensities in the included tests and unambiguously characterizing a molecule in terms of its receptor binding properties. While previous studies of Neighborhood Behavior had to rely on a loose classification of compounds in terms of the therapeutic areas they were designed for, here the NB of a calculated "in silico" similarity metric has been redefined as a relationships between intermolecular dissimilarity scores in the "structural" and "activity" spaces, respectively, and expressed in terms of two quantitative criteria: "consistency" (the propensity of the metric to selectively rank activity-related compound pairs among the structurally most similar pairs) and "completeness" (monitoring the retrieval rate of activity-related compound pairs among the best ranked pairs of structural neighbors). These criteria were used to calibrate and validate a similarity metric based on Fuzzy Bipolar Pharmacophore Fingerprints.