Accurate extraction of functional associations between proteins based on common interaction partners and common domains
Open Access
- 4 February 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (9) , 2043-2048
- https://doi.org/10.1093/bioinformatics/bti305
Abstract
Motivation: Genomic and proteomic approaches have accumulated a huge amount of data which provide clues to protein function. However, interpreting single omic data for predicting uncharacterized protein functions has been a challenging task, because the data contain a lot of false positives. To overcome this problem, methods for integrating data from various omic approaches are needed for more accurate function prediction. Result: In this paper, we have developed a method which extracts functionally similar proteins with high confidence by integrating protein–protein interaction data and domain information. We used this method to analyze publicly available data from Saccharomyces cerevisiae. We identified 1042 functional associations, involving 765 proteins of which 98 (12.8%) had no previously ascribed function. Our method extracts functionally similar protein pairs more accurately than conventional methods, and predicting function for previously uncharacterized proteins can be achieved. Our method can of course be applied to protein–protein interaction data for any species. Contact:okada-k@cb.k.u-tokyo.ac.jpKeywords
This publication has 19 references indexed in Scilit:
- A Map of the Interactome Network of the Metazoan C. elegansScience, 2004
- A Protein Interaction Map of Drosophila melanogasterScience, 2003
- Integrating ‘omic’ information: a bridge between genomics and systems biologyTrends in Genetics, 2003
- InterDom: a database of putative interacting protein domains for validating predicted protein interactions and complexesNucleic Acids Research, 2003
- Analyzing yeast protein–protein interaction data obtained from different sourcesNature Biotechnology, 2002
- The Natural History of Protein DomainsAnnual Review of Biophysics, 2002
- Protein InteractionsMolecular & Cellular Proteomics, 2002
- MIPS: a database for genomes and protein sequencesNucleic Acids Research, 2002
- An information-based sequence distance and its application to whole mitochondrial genome phylogenyBioinformatics, 2001
- The protein kinases of budding yeast: six score and moreTrends in Biochemical Sciences, 1997