Genome-wide discovery of transcriptional modules from DNA sequence and gene expression
Open Access
- 3 July 2003
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 19 (suppl_1) , i273-i282
- https://doi.org/10.1093/bioinformatics/btg1038
Abstract
In this paper, we describe an approach for understanding transcriptional regulation from both gene expression and promoter sequence data. We aim to identify transcriptional modules—sets of genes that are co-regulated in a set of experiments, through a common motif profile. Using the EM algorithm, our approach refines both the module assignment and the motif profile so as to best explain the expression data as a function of transcriptional motifs. It also dynamically adds and deletes motifs, as required to provide a genome-wide explanation of the expression data. We evaluate the method on two Saccharomyces cerevisiae gene expression data sets, showing that our approach is better than a standard one at recovering known motifs and at generating biologically coherent modules. We also combine our results with binding localization data to obtain regulatory relationships with known transcription factors, and show that many of the inferred relationships have support in the literature. Contact: eran@cs.stanford.edu Keywords: probabilistic models, gene expression, transcriptional regulation.Keywords
This publication has 0 references indexed in Scilit: