New molecular shape descriptors: Application in database screening

Abstract
Summary Geometric descriptors are becoming popular tools for encoding molecular shape, for use in database screening and clustering calculations. They provide condensed representations of complex objects and, as a consequence, can usually be compared quite rapidly. Here we present a number of new descriptors and methods for the quantification of molecular shape similarity. The techniques are tested using two different biological systems, with particular emphasis on their potential utility as methods for prescreening shape-based database searches. Results are compared with data sets produced using the DOCK program. We find that such similarity evaluations are useful for finding molecules with complementary shape, and that they contain an enriched number of potential DOCK hits when compared to the original databases. Significant limitations in the utility of such DOCK prescreens are discussed, and potential solutions are considered.

This publication has 22 references indexed in Scilit: