Integrated Assessment and Prediction of Transcription Factor Binding

Open Access

16 June 2006

journal article
research article
Published by Public Library of Science (PLoS) in PLoS Computational Biology

Vol. 2 (6) , e70
https://doi.org/10.1371/journal.pcbi.0020070

Abstract

Systematic chromatin immunoprecipitation (chIP-chip) experiments have become a central technique for mapping transcriptional interactions in model organisms and humans. However, measurement of chromatin binding does not necessarily imply regulation, and binding may be difficult to detect if it is condition or cofactor dependent. To address these challenges, we present an approach for reliably assigning transcription factors (TFs) to target genes that integrates many lines of direct and indirect evidence into a single probabilistic model. Using this approach, we analyze publicly available chIP-chip binding profiles measured for yeast TFs in standard conditions, showing that our model interprets these data with significantly higher accuracy than previous methods. Pooling the high-confidence interactions reveals a large network containing 363 significant sets of factors (TF modules) that cooperate to regulate common target genes. In addition, the method predicts 980 novel binding interactions with high confidence that are likely to occur in so-far untested conditions. Indeed, using new chIP-chip experiments we show that predicted interactions for the factors Rpn4p and Pdr1p are observed only after treatment of cells with methyl-methanesulfonate, a DNA-damaging agent. We outline the first approach for consistently integrating all available evidences for TF–target interactions and we comprehensively identify the resulting TF module hierarchy. Prioritizing experimental conditions for each factor will be especially important as increasing numbers of chIP-chip assays are performed in complex organisms such as humans, for which “standard conditions” are ill defined. Transcription factors (TFs) bind close to their target genes for regulating transcript levels depending on cellular conditions. Each gene may be regulated differently from others through the binding of specific groups of TFs (TF modules). Recently, a wide variety of large-scale measurements about transcriptional networks has become available. Here the authors present a framework for consistently integrating all of this evidence to systematically determine the precise set of genes directly regulated by each TF (i.e., TF–target interactions). The framework is applied to the yeast Saccharomyces cerevisiae using seven distinct sources of evidences to score all possible TF–target interactions in this organism. Subsequently, the authors employ another newly developed algorithm to reveal TF modules based on the top 5,000 TF–target interactions, yielding more than 300 TF modules. The new scoring scheme for TF–target interactions allows predicting the binding of TFs under so-far untested conditions, which is demonstrated by experimentally verifying interactions for two TFs (Pdr1p, Rpn4p). Importantly, the new methods (scoring of TF–target interactions and TF module identification) are scalable to much larger datasets, making them applicable to future studies in humans, which are thought to have substantially larger numbers of TF–target interactions.

Keywords

This publication has 46 references indexed in Scilit:

Inference of combinatorial regulation in yeast transcriptional networks: A case study of sporulation
Proceedings of the National Academy of Sciences, 2005
Transcription regulation and animal diversity
Nature, 2003
Genome-wide discovery of transcriptional modules from DNA sequence and gene expression
Bioinformatics, 2003
Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data
Nature Genetics, 2003
The yeast zinc finger regulators Pdr1p and Pdr3p control pleiotropic drug resistance (PDR) as homo‐ and heterodimers in vivo
Molecular Microbiology, 2002
Transcriptional Regulatory Networks in Saccharomyces cerevisiae
Science, 2002
DNA mismatch repair and acquired cisplatin resistance in E. coli and human ovarian carcinoma cells
DNA Repair, 2002
Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering
Genome Biology, 2002
Revealing modular organization in the yeast transcriptional network
Nature Genetics, 2002
Genomic Expression Responses to DNA-damaging Agents and the Regulatory Role of the Yeast ATR Homolog Mec1p
Molecular Biology of the Cell, 2001