Global protein function annotation through mining genome-scale data in yeast Saccharomyces cerevisiae
Open Access
- 1 January 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 32 (21) , 6414-6424
- https://doi.org/10.1093/nar/gkh978
Abstract
As we are moving into the post genome-sequencing era, various high-throughput experimental techniques have been developed to characterize biological systems on the genomic scale. Discovering new biological knowledge from the high-throughput biological data is a major challenge to bioinformatics today. To address this challenge, we developed a Bayesian statistical method together with Boltzmann machine and simulated annealing for protein functional annotation in the yeast Saccharomyces cerevisiae through integrating various high-throughput biological data, including yeast two-hybrid data, protein complexes and microarray gene expression profiles. In our approach, we quantified the relationship between functional similarity and high-throughput data, and coded the relationship into ‘functional linkage graph’, where each node represents one protein and the weight of each edge is characterized by the Bayesian probability of function similarity between two proteins. We also integrated the evolution information and protein subcellular localization information into the prediction. Based on our method, 1802 out of 2280 unannotated proteins in yeast were assigned functions systematically.Keywords
This publication has 32 references indexed in Scilit:
- New Nanostructured Carbon Coating Inhibits Bacterial Growth, but Does Not Influence on Animal CellsNanomaterials, 2020
- Predicting Subcellular Localization via Protein Motif Co-OccurrenceGenome Research, 2004
- Predicting Protein Complex Membership Using Probabilistic Network ReliabilityGenome Research, 2004
- Global analysis of protein localization in budding yeastNature, 2003
- Global protein function prediction from protein-protein interaction networksNature Biotechnology, 2003
- The Constraints Protein–Protein Interactions Place on Sequence DivergenceJournal of Molecular Biology, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Life with 6000 GenesScience, 1996