Large-scale analysis of transcriptional cis-regulatory modules reveals both common features and distinct subclasses
Open Access
- 5 June 2007
- journal article
- Published by Springer Nature in Genome Biology
- Vol. 8 (6) , R101
- https://doi.org/10.1186/gb-2007-8-6-r101
Abstract
Background: Transcriptional cis-regulatory modules (for example, enhancers) play a critical role in regulating gene expression. While many individual regulatory elements have been characterized, they have never been analyzed as a class. Results: We have performed the first such large-scale study of cis-regulatory modules in order to determine whether they have common properties that might aid in their identification and contribute to our understanding of the mechanisms by which they function. A total of 280 individual, experimentally verified cis-regulatory modules from Drosophila were analyzed for a range of sequence-level and functional properties. We report here that regulatory modules do indeed share common properties, among them an elevated GC content, an increased level of interspecific sequence conservation, and a tendency to be transcribed into RNA. However, we find that dense clustering of transcription factor binding sites, especially homotypic clustering, which is commonly believed to be a general characteristic of regulatory modules, is rather a feature that belongs chiefly to a specific subclass. This has important implications for current computational approaches, many of which are biased toward this subset. We explore two new strategies to assess binding site clustering and gauge their performances with respect to their ability to detect all 280 modules and various functionally coherent subsets. Conclusion: Our findings demonstrate that cis-regulatory modules share common features that help to define them as a class and that may lead to new insights into mechanisms of gene regulation. However, these properties alone may not be sufficient to reliably distinguish regulatory from non-regulatory sequences. We also demonstrate that there are distinct subclasses of cis-regulatory modules that are more amenable to in silico detection than others and that these differences must be taken into account when attempting genome-wide regulatory element discovery.Keywords
This publication has 94 references indexed in Scilit:
- The evolutionary significance of cis-regulatory mutationsNature Reviews Genetics, 2007
- Biological function of unannotated transcription during the early development of Drosophila melanogasterNature Genetics, 2006
- Close sequence comparisons are sufficient to identify human cis-regulatory elementsGenome Research, 2006
- Biological code breaking in the 21st centuryMolecular Systems Biology, 2006
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- Drosophila DNase I footprint database: a systematic genome annotation of transcription factor binding sites in the fruitfly, Drosophila melanogasterBioinformatics, 2004
- Applied bioinformatics for the identification of regulatory elementsNature Reviews Genetics, 2004
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Determination of spatial domains of zygotic gene expression in the Drosophila embryo by the affinity of binding sites for the bicoid morphogenNature, 1989
- The gradient morphogen bicoid is a concentration-dependent transcriptional activatorCell, 1989