From sets to graphs: towards a realistic enrichment analysis of transcriptomic systems
Open Access
- 14 June 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (13) , i366-i373
- https://doi.org/10.1093/bioinformatics/btr228
Abstract
Motivation: Current gene set enrichment approaches do not take interactions and associations between set members into account. Mutual activation and inhibition causing positive and negative correlation among set members are thus neglected. As a consequence, inconsistent regulations and contextless expression changes are reported and, thus, the biological interpretation of the result is impeded. Results: We analyzed established gene set enrichment methods and their result sets in a large-scale investigation of 1000 expression datasets. The reported statistically significant gene sets exhibit only average consistency between the observed patterns of differential expression and known regulatory interactions. We present Gene Graph Enrichment Analysis (GGEA) to detect consistently and coherently enriched gene sets, based on prior knowledge derived from directed gene regulatory networks. Firstly, GGEA improves the concordance of pairwise regulation with individual expression changes in respective pairs of regulating and regulated genes, compared with set enrichment methods. Secondly, GGEA yields result sets where a large fraction of relevant expression changes can be explained by nearby regulators, such as transcription factors, again improving on set-based methods. Thirdly, we demonstrate in additional case studies that GGEA can be applied to human regulatory pathways, where it sensitively detects very specific regulation processes, which are altered in tumors of the central nervous system. GGEA significantly increases the detection of gene sets where measured positively or negatively correlated expression patterns coincide with directed inducing or repressing relationships, thus facilitating further interpretation of gene expression data. Availability: The method and accompanying visualization capabilities have been bundled into an R package and tied to a grahical user interface, the Galaxy workflow environment, that is running as a web server. Contact:Ludwig.Geistlinger@bio.ifi.lmu.de; Ralf.Zimmer@bio.ifi.lmu.deKeywords
This publication has 35 references indexed in Scilit:
- Heading Down the Wrong Pathway: on the Influence of Correlation within Gene SetsBMC Genomics, 2010
- Petri Nets with Fuzzy Logic (PNFL): Reverse Engineering and ParametrizationPLOS ONE, 2010
- Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciencesGenome Biology, 2010
- A novel algorithm for detecting differentially regulated paths based on gene set enrichment analysisBioinformatics, 2009
- Gene-set analysis and reductionBriefings in Bioinformatics, 2008
- RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigationNucleic Acids Research, 2007
- Many Microbe Microarrays Database: uniformly normalized Affymetrix compendia with structured experimental metadataNucleic Acids Research, 2007
- The Chemokine Receptor CXCR4 Strongly Promotes Neuroblastoma Primary Tumour and Metastatic Growth, but not InvasionPLOS ONE, 2007
- Network-Based Analysis of Affected Biological Processes in Type 2 Diabetes ModelsPLoS Genetics, 2007
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005