VIRGO: computational prediction of gene functions
Open Access
- 1 July 2006
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (Web Server) , W340-W344
- https://doi.org/10.1093/nar/gkl225
Abstract
Dramatic advances in sequencing technology and sophisticated experimental assays that interrogate the cell, combined with the public availability of the resulting data, herald the era of systems biology. However, the biological functions of more than 40% of the genes in sequenced genomes are unknown, posing a fundamental barrier to progress in systems biology. The large scale and diversity of available data requires the development of techniques that can automatically utilize these datasets to make quantified and robust predictions of gene function that can be experimentally verified. We present a service called the VIRtual Gene Ontology (VIRGO) that (i) constructs a functional linkage network (FLN) from gene expression and molecular interaction data, (ii) labels genes in the FLN with their functional annotations in the Gene Ontology and (iii) systematically propagates these labels across the FLN in order to precisely predict the functions of unlabelled genes. VIRGO assigns confidence estimates to predicted functions so that a biologist can prioritize predictions for further experimental study. For each prediction, VIRGO also provides an informative ‘propagation diagram’ that traces the flow of information in the FLN that led to the prediction. VIRGO is available at .Keywords
This publication has 39 references indexed in Scilit:
- Mapping gene ontology to proteins based on protein–protein interaction dataBioinformatics, 2004
- Bioverse: functional, structural and contextual annotation of proteins and proteomesNucleic Acids Research, 2003
- Building with a scaffold: emerging strategies for high- to low-level cellular modelingTrends in Biotechnology, 2003
- A Bayesian framework for combining heterogeneous data sources for gene function prediction (inSaccharomyces cerevisiae)Proceedings of the National Academy of Sciences, 2003
- Global protein function prediction from protein-protein interaction networksNature Biotechnology, 2003
- The GRID: The General Repository for Interaction DatasetsGenome Biology, 2003
- Systematic Learning of Gene Functional Classes From DNA Array Expression Data by Using Multilayer PerceptronsGenome Research, 2002
- Functional Discovery via a Compendium of Expression ProfilesCell, 2000
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- A combined algorithm for genome-wide prediction of protein functionNature, 1999