Predicting protein functions from redundancies in large-scale protein interaction networks
- 17 October 2003
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (22) , 12579-12583
- https://doi.org/10.1073/pnas.2132527100
Abstract
Interpreting data from large-scale protein interaction experiments has been a challenging task because of the widespread presence of random false positives. Here, we present a network-based statistical algorithm that overcomes this difficulty and allows us to derive functions of unannotated proteins from large-scale interaction data. Our algorithm uses the insight that if two proteins share significantly larger number of common interaction partners than random, they have close functional associations. Analysis of publicly available data from Saccharomyces cerevisiae reveals >2,800 reliable functional associations, 29% of which involve at least one unannotated protein. By further analyzing these associations, we derive tentative functions for 81 unannotated proteins with high certainty. Our method is not overly sensitive to the false positives present in the data. Even after adding 50% randomly generated interactions to the measured data set, we are able to recover almost all (≈89%) of the original associations.Keywords
This publication has 20 references indexed in Scilit:
- Functional Fingerprints of Folds: Evidence for Correlated Structure–Function EvolutionJournal of Molecular Biology, 2003
- Analyzing yeast protein–protein interaction data obtained from different sourcesNature Biotechnology, 2002
- Treasures and traps in genome-wide data sets: case examples from yeastNature Reviews Genetics, 2002
- Comparative assessment of large-scale data sets of protein–protein interactionsNature, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- Systematic Genetic Analysis with Ordered Arrays of Yeast Deletion MutantsScience, 2001
- Global Analysis of Protein Activities Using Proteome ChipsScience, 2001
- A comprehensive two-hybrid analysis to explore the yeast protein interactomeProceedings of the National Academy of Sciences, 2001
- SGD: Saccharomyces Genome DatabaseNucleic Acids Research, 1998