PIBASE: a comprehensive database of structurally defined protein interfaces
Open Access
- 18 January 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (9) , 1901-1907
- https://doi.org/10.1093/bioinformatics/bti277
Abstract
Motivation: In recent years, the Protein Data Bank (PDB) has experienced rapid growth. To maximize the utility of the high resolution protein–protein interaction data stored in the PDB, we have developed PIBASE, a comprehensive relational database of structurally defined interfaces between pairs of protein domains. It is composed of binary interfaces extracted from structures in the PDB and the Probable Quaternary Structure server using domain assignments from the Structural Classification of Proteins and CATH fold classification systems. Results: PIBASE currently contains 158 915 interacting domain pairs between 105 061 domains from 2125 SCOP families. A diverse set of geometric, physiochemical and topologic properties are calculated for each complex, its domains, interfaces and binding sites. A subset of the interface properties are used to remove interface redundancy within PDB entries, resulting in 20 912 distinct domain–domain interfaces. The complexes are grouped into 989 topological classes based on their patterns of domain–domain contacts. The binary interfaces and their corresponding binding sites are categorized into 18 755 and 30 975 topological classes, respectively, based on the topology of secondary structure elements. The utility of the database is illustrated by outlining several current applications. Availability: The database is accessible via the world wide web at http://salilab.org/pibase Contact:sali@salilab.org Supplementary information:http://salilab.org/pibase/suppinfo.htmlKeywords
This publication has 47 references indexed in Scilit:
- BIND: the Biomolecular Interaction Network DatabaseNucleic Acids Research, 2003
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- A comprehensive two-hybrid analysis to explore the yeast protein interactomeProceedings of the National Academy of Sciences, 2001
- The Protein Data BankNucleic Acids Research, 2000
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- A novel genetic system to detect protein–protein interactionsNature, 1989
- Surface, subunit interfaces and interior of oligomeric proteinsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983