Advanced Exact Structure Searching in Large Databases of Chemical Compounds
- 22 February 2003
- journal article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 43 (3) , 852-860
- https://doi.org/10.1021/ci025582d
Abstract
Efficient recognition of tautomeric compound forms in large corporate or commercially available compound databases is a difficult and labor intensive task. Our data indicate that up to 0.5% of commercially available compound collections for bioscreening contain tautomers. Though in the large registry databases, such as Beilstein and CAS, the tautomers are found in an automated fashion using high-performance computational technologies, their real-time recognition in the nonregistry corporate databases, as a rule, remains problematic. We have developed an effective algorithm for tautomer searching based on the proprietary chemoinformatics platform. This algorithm reduces the compound to a canonical structure. This feature enables rapid, automated computer searching of most of the known tautomeric transformations that occur in databases of organic compounds. Another useful extension of this methodology is related to the ability to effectively search for different forms of compounds that contain ionic and semipolar bonds. The computations are performed in the Windows environment on a standard personal computer, a very useful feature. The practical application of the proposed methodology is illustrated by several examples of successful recovery of tautomers and different forms of ionic compounds from real commercially available nonregistry databases.Keywords
This publication has 5 references indexed in Scilit:
- New Diversity Calculations Algorithms Used for Compound SelectionJournal of Chemical Information and Computer Sciences, 2002
- Solvent Effects on Relative Stability of Meridine and Its Tautomer: MO CalculationsHETEROCYCLES, 2001
- CheD: Chemical Database Compilation Tool, Internet Server, and Client for SQL ServersJournal of Chemical Information and Computer Sciences, 2000
- Observation of thermal tautomerism of thermochromic salicylideneaniline derivatives in the solid state by 15N CPMAS NMR down to cryogenic temperaturesBerichte der Bunsengesellschaft für physikalische Chemie, 1998
- The Chemical Abstracts Service Chemical Registry System. VII. Tautomerism and Alternating BondsJournal of Chemical Information and Computer Sciences, 1980