Integrated Weighted Gene Co-expression Network Analysis with an Application to Chronic Fatigue Syndrome
Open Access
- 6 November 2008
- journal article
- research article
- Published by Springer Nature in BMC Systems Biology
- Vol. 2 (1) , 95
- https://doi.org/10.1186/1752-0509-2-95
Abstract
Background: Systems biologic approaches such as Weighted Gene Co-expression Network Analysis (WGCNA) can effectively integrate gene expression and trait data to identify pathways and candidate biomarkers. Here we show that the additional inclusion of genetic marker data allows one to characterize network relationships as causal or reactive in a chronic fatigue syndrome (CFS) data set. Results: We combine WGCNA with genetic marker data to identify a disease-related pathway and its causal drivers, an analysis which we refer to as "Integrated WGCNA" or IWGCNA. Specifically, we present the following IWGCNA approach: 1) construct a co-expression network, 2) identify trait-related modules within the network, 3) use a trait-related genetic marker to prioritize genes within the module, 4) apply an integrated gene screening strategy to identify candidate genes and 5) carry out causality testing to verify and/or prioritize results. By applying this strategy to a CFS data set consisting of microarray, SNP and clinical trait data, we identify a module of 299 highly correlated genes that is associated with CFS severity. Our integrated gene screening strategy results in 20 candidate genes. We show that our approach yields biologically interesting genes that function in the same pathway and are causal drivers for their parent module. We use a separate data set to replicate findings and use Ingenuity Pathways Analysis software to functionally annotate the candidate gene pathways. Conclusion: We show how WGCNA can be combined with genetic marker data to identify disease-related pathways and the causal drivers within them. The systems genetics approach described here can easily be used to generate testable genetic hypotheses in other complex disease studies.This publication has 73 references indexed in Scilit:
- Genetics of gene expression and its effect on diseaseNature, 2008
- Variations in DNA elucidate molecular networks that cause diseaseNature, 2008
- Weighted gene coexpression network analysis strategies applied to mouse weightMammalian Genome, 2007
- Conservation and evolution of gene coexpression networks in human and chimpanzee brainsProceedings of the National Academy of Sciences, 2006
- Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular targetProceedings of the National Academy of Sciences, 2006
- Identification of inflammatory gene modules based on variations of human endothelial cell responses to oxidized lipidsProceedings of the National Academy of Sciences, 2006
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- An integrative genomics approach to infer causal associations between gene expression and diseaseNature Genetics, 2005
- Immune Modulation of the Hypothalamic-Pituitary-Adrenal (HPA) Axis during Viral InfectionViral Immunology, 2005
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003