SCORE: A computational approach to the identification of cis-regulatory modules and target genes in whole-genome sequence data
- 9 July 2002
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 99 (15) , 9888-9893
- https://doi.org/10.1073/pnas.152320899
Abstract
A large fraction of the information content of metazoan genomes resides in the transcriptional and posttranscriptional cis-regulatory elements that collectively provide the blueprint for using the protein-coding capacity of the DNA, thus guiding the development and physiology of the entire organism. As successive whole-genome sequencing projects--including those of mice and humans--are completed, we have full access to the regulatory genome of yet another species. But our ability to decipher the cis-regulatory code, and hence to link genes into regulatory networks on a global scale, is currently very limited. Here we describe SCORE (Site Clustering Over Random Expectation), a computational method for identifying transcriptional cis-regulatory modules based on the fact that they often contain, in statistically improbable concentrations, multiple binding sites for the same transcription factor. We have carried out a Drosophila genomewide inventory of predicted binding sites for the Notch-regulated transcription factor Suppressor of Hairless [Su(H)] and found that the fly genome contains highly nonrandom clusterings of Su(H) sites over a broad range of sequence intervals. We found that the most statistically significant clusters are very heavily enriched in both known and logical targets of Su(H) binding and regulation. The utility of the SCORE approach was validated by in vivo experiments showing that proper expression of the novel gene Him in adult muscle precursor cells depends both on Su(H) gene activity and sequences that include a previously unstudied cluster of four Su(H) sites, indicating that Him is a likely direct target of Su(H).Keywords
This publication has 37 references indexed in Scilit:
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Comparative Analysis of the Human and Mouse Hey1 Promoter: Hey Genes Are New Notch Target GenesBiochemical and Biophysical Research Communications, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Notch Signaling: Cell Fate Control and Signal Integration in DevelopmentScience, 1999
- Suppressor of hairless directly activates transcription of enhancer of split complex genes in response to Notch receptor activity.Genes & Development, 1995
- The neurogenic suppressor of hairless DNA-binding protein mediates the transcriptional activation of the enhancer of split complex genes triggered by Notch signaling.Genes & Development, 1995
- The suppressor of hairless protein participates in notch receptor signalingCell, 1994
- Recognition sequence of a highly conserved DNA binding protein RBP-JxNucleic Acids Research, 1994
- The Drosophila rhomboid gene mediates the localized formation of wing veins and interacts genetically with components of the EGF-R signaling pathway.Genes & Development, 1993
- Suppressor of Hairless, the Drosophila homolog of the mouse recombination signal-binding protein gene, controls sensory organ cell fatesCell, 1992