A Markov random field model for network-based analysis of genomic data
Open Access
- 5 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (12) , 1537-1544
- https://doi.org/10.1093/bioinformatics/btm129
Abstract
Motivation: A central problem in genomic research is the identification of genes and pathways involved in diseases and other biological processes. The genes identified or the univariate test statistics are often linked to known biological pathways through gene set enrichment analysis in order to identify the pathways involved. However, most of the procedures for identifying differentially expressed (DE) genes do not utilize the known pathway information in the phase of identifying such genes. In this article, we develop a Markov random field (MRF)-based method for identifying genes and subnetworks that are related to diseases. Such a procedure models the dependency of the DE patterns of genes on the networks using a local discrete MRF model. Results: Simulation studies indicated that the method is quite effective in identifying genes and subnetworks that are related to disease and has higher sensitivity and lower false discovery rates than the commonly used procedures that do not use the pathway structure information. Applications to two breast cancer microarray gene expression datasets identified several subnetworks on several of the KEGG transcriptional pathways that are related to breast cancer recurrence or survival due to breast cancer. Conclusions: The proposed MRF-based model efficiently utilizes the known pathway structures in identifying the DE genes and the subnetworks that might be related to phenotype. As more biological networks are identified and documented in databases, the proposed method should find more applications in identifying the subnetworks that are related to diseases and other biological processes. Contact: hongzhe@mail.med.upenn.edu or hli@cceb.upenn.eduKeywords
This publication has 29 references indexed in Scilit:
- Mining the Wnt pathway for cancer therapeuticsNature Reviews Drug Discovery, 2006
- The claudin gene family: expression in normal and neoplastic tissuesBMC Cancer, 2006
- An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survivalProceedings of the National Academy of Sciences, 2005
- Stem cell-ness: a "magic marker" for cancerJournal of Clinical Investigation, 2005
- Divergent cyclin B1 expression and Rb/p16/cyclin D1 pathway aberrations among pulmonary neuroendocrine tumorsLaboratory Investigation, 2004
- An Integrated Probabilistic Model for Functional Prediction of ProteinsJournal of Computational Biology, 2004
- On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profilesStatistics in Medicine, 2003
- Predicting protein function from protein/protein interaction data: a probabilistic approachBioinformatics, 2003
- Empirical Bayes Analysis of a Microarray ExperimentJournal of the American Statistical Association, 2001
- Fibroblast growth factor receptors: lessons from the genesTrends in Biochemical Sciences, 1998