Systematic survey reveals general applicability of "guilt-by-association" within gene coexpression networks
Top Cited Papers
Open Access
- 14 September 2005
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 6 (1) , 227
- https://doi.org/10.1186/1471-2105-6-227
Abstract
Background: Biological processes are carried out by coordinated modules of interacting molecules. As clustering methods demonstrate that genes with similar expression display increased likelihood of being associated with a common functional module, networks of coexpressed genes provide one framework for assigning gene function. This has informed the guilt-by-association (GBA) heuristic, widely invoked in functional genomics. Yet although the idea of GBA is accepted, the breadth of GBA applicability is uncertain. Results: We developed methods to systematically explore the breadth of GBA across a large and varied corpus of expression data to answer the following question: To what extent is the GBA heuristic broadly applicable to the transcriptome and conversely how broadly is GBA captured by a priori knowledge represented in the Gene Ontology (GO)? Our study provides an investigation of the functional organization of five coexpression networks using data from three mammalian organisms. Our method calculates a probabilistic score between each gene and each Gene Ontology category that reflects coexpression enrichment of a GO module. For each GO category we use Receiver Operating Curves to assess whether these probabilistic scores reflect GBA. This methodology applied to five different coexpression networks demonstrates that the signature of guilt-by-association is ubiquitous and reproducible and that the GBA heuristic is broadly applicable across the population of nine hundred Gene Ontology categories. We also demonstrate the existence of highly reproducible patterns of coexpression between some pairs of GO categories. Conclusion: We conclude that GBA has universal value and that transcriptional control may be more modular than previously realized. Our analyses also suggest that methodologies combining coexpression measurements across multiple genes in a biologically-defined module can aid in characterizing gene function or in characterizing whether pairs of functions operate together.Keywords
This publication has 25 references indexed in Scilit:
- Standardizing global gene expression analysis between laboratories and across platformsNature Methods, 2005
- An oncogenic KRAS2 expression signature identified by cross-species gene-expression analysisNature Genetics, 2004
- A Gene-Coexpression Network for Global Discovery of Conserved Genetic ModulesScience, 2003
- PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetesNature Genetics, 2003
- Global functional profiling of gene expression☆☆This work was funded in part by a Sun Microsystems grant awarded to S.D., NIH Grant HD36512 to S.A.K., a Wayne State University SOM Dean’s Post-Doctoral Fellowship, and an NICHD Contraception and Infertility Loan to G.C.O. Support from the WSU MCBI mode is gratefully appreciated.Genomics, 2003
- Revealing modular organization in the yeast transcriptional networkNature Genetics, 2002
- Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clustersNature Genetics, 2002
- Computational analysis of microarray dataNature Reviews Genetics, 2001
- Functional Discovery via a Compendium of Expression ProfilesCell, 2000
- Exploring the new world of the genome with DNA microarraysNature Genetics, 1999