Retrieval of Crystallographically-Derived Molecular Geometry Information
Top Cited Papers
- 13 October 2004
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 44 (6) , 2133-2144
- https://doi.org/10.1021/ci049780b
Abstract
The crystallographically determined bond length, valence angle, and torsion angle information in the Cambridge Structural Database (CSD) has many uses. However, accessing it by means of conventional substructure searching requires nontrivial user intervention. In consequence, these valuable data have been underutilized and have not been directly accessible to client applications. The situation has been remedied by development of a new program (Mogul) for automated retrieval of molecular geometry data from the CSD. The program uses a system of keys to encode the chemical environments of fragments (bonds, valence angles, and acyclic torsions) from CSD structures. Fragments with identical keys are deemed to be chemically identical and are grouped together, and the distribution of the appropriate geometrical parameter (bond length, valence angle, or torsion angle) is computed and stored. Use of a search tree indexed on key values, together with a novel similarity calculation, then enables the distribution matching any given query fragment (or the distributions most closely matching, if an adequate exact match is unavailable) to be found easily and with no user intervention. Validation experiments indicate that, with rare exceptions, search results afford precise and unbiased estimates of molecular geometrical preferences. Such estimates may be used, for example, to validate the geometries of libraries of modeled molecules or of newly determined crystal structures or to assist structure solution from low-resolution (e.g. powder diffraction) X-ray data.Keywords
This publication has 11 references indexed in Scilit:
- CRYSTALSversion 12: software for guided crystal structure analysisJournal of Applied Crystallography, 2003
- New software for searching the Cambridge Structural Database and visualizing crystal structuresActa crystallographica Section B, Structural science, crystal engineering and materials, 2002
- A citation analysis of the Cambridge Crystallographic Data CentreJournal of Applied Crystallography, 2001
- Comparison of Knowledge-Based and Distance Geometry Approaches for Generation of Molecular ConformationsJournal of Chemical Information and Computer Sciences, 2001
- BALI: Automatic Assignment of Bond and Atom Types for Protein Ligands in the Brookhaven Protein DatabankJournal of Chemical Information and Computer Sciences, 1997
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- Time-efficient flexible superposition of medium-sized moleculesJournal of Computer-Aided Molecular Design, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- Automatic assignment of chemical connectivity to organic molecules in the Cambridge Structural DatabaseJournal of Chemical Information and Computer Sciences, 1992
- Structure and crystal chemistry of mixed-valence ternary platinum oxides: MnPt3O6, CoPt3O6, ZnPt3O6, MgPt3O6, and NiPt3O6: erratumActa crystallographica Section B, Structural science, crystal engineering and materials, 1983