An evaluation of human protein-protein interaction data in the public domain
Open Access
- 18 December 2006
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 7 (S5) , 1-14
- https://doi.org/10.1186/1471-2105-7-s5-s19
Abstract
Protein-protein interaction (PPI) databases have become a major resource for investigating biological networks and pathways in cells. A number of publicly available repositories for human PPIs are currently available. Each of these databases has their own unique features with a large variation in the type and depth of their annotations. We analyzed the major publicly available primary databases that contain literature curated PPI information for human proteins. This included BIND, DIP, HPRD, IntAct, MINT, MIPS, PDZBase and Reactome databases. The number of binary non-redundant human PPIs ranged from 101 in PDZBase and 346 in MIPS to 11,367 in MINT and 36,617 in HPRD. The number of genes annotated with at least one interactor was 9,427 in HPRD, 4,975 in MINT, 4,614 in IntAct, 3,887 in BIND and <1,000 in the remaining databases. The number of literature citations for the PPIs included in the databases was 43,634 in HPRD, 11,480 in MINT, 10,331 in IntAct, 8,020 in BIND and <2,100 in the remaining databases. Given the importance of PPIs, we suggest that submission of PPIs to repositories be made mandatory by scientific journals at the time of manuscript submission as this will minimize annotation errors, promote standardization and help keep the information up to date. We hope that our analysis will help guide biomedical scientists in selecting the most appropriate database for their needs especially in light of the dramatic differences in their content.Keywords
This publication has 29 references indexed in Scilit:
- Proteomic resources: Integrating biomedical information in humansGene, 2005
- Ulysses - an application for the projection of molecular interactions across speciesGenome Biology, 2005
- Inferring protein domain interactions from databases of interacting proteinsGenome Biology, 2005
- The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction dataNature Biotechnology, 2004
- IntAct: an open source molecular interaction databaseNucleic Acids Research, 2004
- The Database of Interacting Proteins: 2004 updateNucleic Acids Research, 2004
- Development of Human Protein Reference Database as an Initial Platform for Approaching Systems Biology in HumansGenome Research, 2003
- Osprey: a network visualization systemGenome Biology, 2003
- Protein InteractionsMolecular & Cellular Proteomics, 2002
- MINT: a Molecular INTeraction databaseFEBS Letters, 2001