CREDO: A Protein–Ligand Interaction Database for Drug Discovery
Open Access
- 22 January 2009
- journal article
- Published by Wiley in Chemical Biology & Drug Design
- Vol. 73 (2) , 157-167
- https://doi.org/10.1111/j.1747-0285.2008.00762.x
Abstract
Harnessing data from the growing number of protein–ligand complexes in the Protein Data Bank is an important task in drug discovery. In order to benefit from the abundance of three‐dimensional structures, structural data must be integrated with sequence as well as chemical data and the protein–small molecule interactions characterized structurally at the inter‐atomic level. In this study, we present CREDO, a new publicly available database of protein–ligand interactions, which represents contacts as structural interaction fingerprints, implements novel features and is completely scriptable through its application programming interface. Features of CREDO include implementation of molecular shape descriptors with ultrafast shape recognition, fragmentation of ligands in the Protein Data Bank, sequence‐to‐structure mapping and the identification of approved drugs. Selected analyses of these key features are presented to highlight a range of potential applications of CREDO. The CREDO dataset has been released into the public domain together with the application programming interface under a Creative Commons license at http://www‐cryst.bioc.cam.ac.uk/credo. We believe that the free availability and numerous features of CREDO database will be useful not only for commercial but also for academia‐driven drug discovery programmes.Keywords
This publication has 53 references indexed in Scilit:
- Chemical substructures that enrich for biological activityBioinformatics, 2008
- ChEBI: a database and ontology for chemical entities of biological interestNucleic Acids Research, 2007
- STITCH: interaction networks of chemicals and proteinsNucleic Acids Research, 2007
- AutoPSI: a database for automatic structural classification of protein sequences and structuresNucleic Acids Research, 2007
- Ensembl 2007Nucleic Acids Research, 2006
- BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinitiesNucleic Acids Research, 2006
- The Universal Protein Resource (UniProt)Nucleic Acids Research, 2006
- Pfam: clans, web tools and servicesNucleic Acids Research, 2006
- The Protein Data BankNucleic Acids Research, 2000
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997