Characterization and prediction of protein–protein interactions within and between complexes
- 3 October 2006
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 103 (40) , 14718-14723
- https://doi.org/10.1073/pnas.0603352103
Abstract
Databases of experimentally determined protein interactions provide information on binary interactions and on involvement in multiprotein complexes. These data are valuable for understanding the general properties of the interaction between proteins as well as for the development of prediction schemes for unknown interactions. Here we analyze experimentally determined protein interactions by measuring various sequence, genomic, transcriptomic, and proteomic attributes of each interacting pair in the yeast Saccharomyces cerevisiae . We find that dividing the data into two groups, one that includes binary interactions within protein complexes (stable) and another that includes binary interactions that are not within complexes (transient), enables better characterization of the interactions by the different attributes and improves the prediction of new interactions. This analysis revealed that most attributes were more indicative in the set of intracomplex interactions. Using this data set for training, we integrated the different attributes by logistic regression and developed a predictive scheme that distinguishes between interacting and noninteracting protein pairs. Analysis of the logistic-regression model showed that one of the strongest contributors to the discrimination between interacting and noninteracting pairs is the presence of distinct pairs of domain signatures that were suggested previously to characterize interacting proteins. The predictive algorithm succeeds in identifying both intracomplex and other interactions (possibly the more stable ones), and its correct identification rate is 2-fold higher than that of large-scale yeast two-hybrid experiments.Keywords
This publication has 57 references indexed in Scilit:
- Conserved patterns of protein interaction in multiple speciesProceedings of the National Academy of Sciences, 2005
- A Map of the Interactome Network of the Metazoan C. elegansScience, 2004
- A Protein Interaction Map of Drosophila melanogasterScience, 2003
- Global analysis of protein localization in budding yeastNature, 2003
- BIND: the Biomolecular Interaction Network DatabaseNucleic Acids Research, 2003
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Comparative assessment of large-scale data sets of protein–protein interactionsNature, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- A comprehensive two-hybrid analysis to explore the yeast protein interactomeProceedings of the National Academy of Sciences, 2001