Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data
Open Access
- 18 March 2004
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1) , 31
- https://doi.org/10.1186/1471-2105-5-31
Abstract
Functional genomics studies are yielding information about regulatory processes in the cell at an unprecedented scale. In the yeast S. cerevisiae, DNA microarrays have not only been used to measure the mRNA abundance for all genes under a variety of conditions but also to determine the occupancy of all promoter regions by a large number of transcription factors. The challenge is to extract useful information about the global regulatory network from these data. We present MA-Networker, an algorithm that combines microarray data for mRNA expression and transcription factor occupancy to define the regulatory network of the cell. Multivariate regression analysis is used to infer the activity of each transcription factor, and the correlation across different conditions between this activity and the mRNA expression of a gene is interpreted as regulatory coupling strength. Applying our method to S. cerevisiae, we find that, on average, 58% of the genes whose promoter region is bound by a transcription factor are true regulatory targets. These results are validated by an analysis of enrichment for functional annotation, response for transcription factor deletion, and over-representation of cis-regulatory motifs. We are able to assign directionality to transcription factors that control divergently transcribed genes sharing the same promoter region. Finally, we identify an intrinsic limitation of transcription factor deletion experiments related to the combinatorial nature of transcriptional control, to which our approach provides an alternative. Our reliable classification of ChIP positives into functional and non-functional TF targets based on their expression pattern across a wide range of conditions provides a starting point for identifying the unknown sequence features in non-coding DNA that directly or indirectly determine the context dependence of transcription factor action. Complete analysis results are available for browsing or download at http://bussemaker.bio.columbia.edu/papers/MA-Networker/.Keywords
This publication has 28 references indexed in Scilit:
- Protein–DNA interaction mapping using genomic tiling path microarrays in DrosophilaProceedings of the National Academy of Sciences, 2003
- Genomewide analysis of Drosophila GAGA factor target genes reveals context-dependent DNA bindingProceedings of the National Academy of Sciences, 2003
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Chromatin profiling using targeted DNA adenine methyltransferaseNature Genetics, 2001
- Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBFNature, 2001
- Genome-Wide Location and Function of DNA Binding ProteinsScience, 2000
- Chromosomal landscape of nucleosome-dependent gene expression and silencing in yeastNature, 1999
- The Transcriptional Program in the Response of Human Fibroblasts to SerumScience, 1999
- Exploring the new world of the genome with DNA microarraysNature Genetics, 1999
- Expression monitoring by hybridization to high-density oligonucleotide arraysNature Biotechnology, 1996