Along signal paths: an empirical gene set approach exploiting pathway topology
Open Access
- 21 September 2012
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 41 (1) , e19
- https://doi.org/10.1093/nar/gks866
Abstract
Gene set analysis using biological pathways has become a widely used statistical approach for gene expression analysis. A biological pathway can be represented through a graph where genes and their interactions are, respectively, nodes and edges of the graph. From a biological point of view only some portions of a pathway are expected to be altered; however, few methods using pathway topology have been proposed and none of them tries to identify the signal paths, within a pathway, mostly involved in the biological problem. Here, we present a novel algorithm for pathway analysis clipper, that tries to fill in this gap. clipper implements a two-step empirical approach based on the exploitation of graph decomposition into a junction tree to reconstruct the most relevant signal path. In the first step clipper selects significant pathways according to statistical tests on the means and the concentration matrices of the graphs derived from pathway topologies. Then, it identifies within these pathways the signal paths having the greatest association with a specific phenotype. We test our approach on simulated and two real expression datasets. Our results demonstrate the efficacy of clipper in the identification of signal transduction paths totally coherent with the biological problem.Keywords
This publication has 32 references indexed in Scilit:
- Network-enabled gene expression analysisBMC Bioinformatics, 2012
- graphite - a Bioconductor package to convert pathway topology to gene networkBMC Bioinformatics, 2012
- Reactome knowledgebase of human biological pathways and processesNucleic Acids Research, 2008
- PID: the Pathway Interaction DatabaseNucleic Acids Research, 2008
- Calpain 3 is a modulator of the dysferlin protein complex in skeletal muscleHuman Molecular Genetics, 2008
- Analyzing gene expression data in terms of gene sets: methodological issuesBioinformatics, 2007
- Accurate Ranking of Differentially Expressed Genes by a Distribution-Free Shrinkage ApproachStatistical Applications in Genetics and Molecular Biology, 2007
- Nuclear envelope dystrophies show a transcriptional fingerprint suggesting disruption of Rb–MyoD pathways in muscle regenerationBrain, 2006
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- A Shrinkage Approach to Large-Scale Covariance Matrix Estimation and Implications for Functional GenomicsStatistical Applications in Genetics and Molecular Biology, 2005