Chemical Substructure Search in SQL
- 15 December 2008
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 49 (1) , 22-27
- https://doi.org/10.1021/ci8003013
Abstract
We present a novel technique for a fast chemical substructure search on a relational database by use of a standard SQL query. The symmetry of a query graph is analyzed to give additional constraints. Our method is based on breadth-first search (BFS) algorithms implementation using Relational Database Management Systems (RDBMS). In addition to the chemical search we apply our technique to the field of intermolecular interactions which involves nonplanar graphs and describe how to achieve linear time performance along with the suggestion on how to sufficiently reduce the linear coefficient. From the algorithms theory perspective these results mean that subgraph isomorphism is a polynomial time problem, hence equal problems have the same complexity. The application to subgraph isomorphism in chemical search is available at http://www.ebi.ac.uk/msd-srv/chemsearch and http://www.ebi.ac.uk/msd-srv/msdmotif/chem. The application to the network of molecule interactions is available at http://www.ebi.ac.uk/msd-srv/msdmotif.Keywords
This publication has 19 references indexed in Scilit:
- MSDmotif: exploring protein sites and motifsBMC Bioinformatics, 2008
- Understanding topological symmetry: A heuristic approach to its determinationJournal of Computational Chemistry, 2007
- Query Chem: a Google-powered web search combining text and chemical structuresBioinformatics, 2006
- Crocker, Not Armit and Robinson, Begat the Six Aromatic ElectronsChemical Reviews, 2005
- MSDsite: A database search and retrieval system for the analysis and viewing of bound ligands and active sitesProteins-Structure Function and Bioinformatics, 2004
- The Protein Data BankNucleic Acids Research, 2000
- Messenger and S4: A Comparison of Structure Search SystemsJournal of Chemical Information and Computer Sciences, 1994
- Decomposing constraint satisfaction problems using database techniquesArtificial Intelligence, 1994
- Substructure searching methods: Old and newJournal of Chemical Information and Computer Sciences, 1993
- Kuratowski's theoremJournal of Graph Theory, 1981