A code for transcription initiation in mammalian genomes
Open Access
- 21 November 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (1) , 1-12
- https://doi.org/10.1101/gr.6831208
Abstract
Genome-wide detection of transcription start sites (TSSs) has revealed that RNA Polymerase II transcription initiates at millions of positions in mammalian genomes. Most core promoters do not have a single TSS, but an array of closely located TSSs with different rates of initiation. As a rule, genes have more than one such core promoter; however, defining the boundaries between core promoters is not trivial. These discoveries prompt a re-evaluation of our models for transcription initiation. We describe a new framework for understanding the organization of transcription initiation. We show that initiation events are clustered on the chromosomes at multiple scales—clusters within clusters—indicating multiple regulatory processes. Within the smallest of such clusters, which can be interpreted as core promoters, the local DNA sequence predicts the relative transcription start usage of each nucleotide with a remarkable 91% accuracy, implying the existence of a DNA code that determines TSS selection. Conversely, the total expression strength of such clusters is only partially determined by the local DNA sequence. Thus, the overall control of transcription can be understood as a combination of large- and small-scale effects; the selection of transcription start sites is largely governed by the local DNA sequence, whereas the transcriptional activity of a locus is regulated at a different level; it is affected by distal features or events such as enhancers and chromatin remodeling.Keywords
This publication has 56 references indexed in Scilit:
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Analysis of overrepresented motifs in human core promoters reveals dual regulatory roles of YY1Genome Research, 2007
- Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics, 2007
- A genomic code for nucleosome positioningNature, 2006
- The General Transcription Machinery and General CofactorsCritical Reviews in Biochemistry and Molecular Biology, 2006
- Diversification of transcriptional modulation: Large-scale identification and characterization of putative alternative promoters of human genesGenome Research, 2005
- Genome-scale profiling of histone H3.3 replacement patternsNature Genetics, 2005
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- The Human Genome Browser at UCSCGenome Research, 2002
- The “initiator” as a transcription control elementCell, 1989