A probabilistic view of gene function
Open Access
- 27 May 2004
- journal article
- research article
- Published by Springer Nature in Nature Genetics
- Vol. 36 (6) , 559-564
- https://doi.org/10.1038/ng1370
Abstract
Cells are controlled by the complex and dynamic actions of thousands of genes. With the sequencing of many genomes, the key problem has shifted from identifying genes to knowing what the genes do; we need a framework for expressing that knowledge. Even the most rigorous attempts to construct ontological frameworks describing gene function (e.g., the Gene Ontology project) ultimately rely on manual curation and are thus labor-intensive and subjective. But an alternative exists: the field of functional genomics is piecing together networks of gene interactions, and although these data are currently incomplete and error-prone, they provide a glimpse of a new, probabilistic view of gene function. We outline such a framework, which revolves around a statistical description of gene interactions derived from large, systematically compiled data sets. In this probabilistic view, pleiotropy is implicit, all data have errors and the definition of gene function is an iterative process that ultimately converges on the correct functions. The relationships between the genes are defined by the data, not by hand. Even this comprehensive view fails to capture key aspects of gene function, not least their dynamics in time and space, showing that there are limitations to the model that must ultimately be addressed.Keywords
This publication has 58 references indexed in Scilit:
- The InterPro Database, 2003 brings increased coverage and new featuresNucleic Acids Research, 2003
- The MetaCyc DatabaseNucleic Acids Research, 2002
- The EcoCyc DatabaseNucleic Acids Research, 2002
- The KEGG databases at GenomeNetNucleic Acids Research, 2002
- MIPS: a database for genomes and protein sequencesNucleic Acids Research, 2002
- The Pfam Protein Families DatabaseNucleic Acids Research, 2002
- Human STAGA Complex Is a Chromatin-Acetylating Transcription Coactivator That Interacts with Pre-mRNA Splicing and DNA Damage-Binding Factors In VivoMolecular and Cellular Biology, 2001
- Creating the Gene Ontology Resource: Design and ImplementationGenome Research, 2001
- UV-damaged DNA-binding protein in the TFTC complex links DNA damage recognition to nucleosome acetylationThe EMBO Journal, 2001
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000