High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites
Open Access
- 6 November 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (1) , 14-25
- https://doi.org/10.1093/nar/gkn866
Abstract
In silico prediction of transcription factor binding sites (TFBSs) is central to the task of gene regulatory network elucidation. Genomic DNA sequence information provides a basis for these predictions, due to the sequence specificity of TF-binding events. However, DNA sequence alone is an impoverished source of information for the task of TFBS prediction in eukaryotes, as additional factors, such as chromatin structure regulate binding events. We show that incorporating high-throughput chromatin modification estimates can greatly improve the accuracy of in silico prediction of in vivo binding for a wide range of TFs in human and mouse. This improvement is superior to the improvement gained by equivalent use of either transcription start site proximity or phylogenetic conservation information. Importantly, predictions made with the use of chromatin structure information are tissue specific. This result supports the biological hypothesis that chromatin modulates TF binding to produce tissue-specific binding profiles in higher eukaryotes, and suggests that the use of chromatin modification information can lead to accurate tissue-specific transcriptional regulatory network elucidation.Keywords
This publication has 40 references indexed in Scilit:
- Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem CellsCell, 2008
- A core Klf circuitry regulates self-renewal of embryonic stem cellsNature Cell Biology, 2008
- A Nucleosome-Guided Map of Transcription Factor Binding Sites in YeastPLoS Computational Biology, 2007
- Reliable prediction of regulator targets using 12 Drosophila genomesGenome Research, 2007
- Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAsCell, 2004
- Finding functional sequence elements by multiple local alignmentNucleic Acids Research, 2004
- Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usageProceedings of the National Academy of Sciences, 2003
- Searching for statistically significant regulatory modulesBioinformatics, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Cluster-Buster: finding dense clusters of motifs in DNA sequencesNucleic Acids Research, 2003