The complex eukaryotic transcriptome: unexpected pervasive transcription and novel small RNAs
Top Cited Papers
- 1 December 2009
- journal article
- review article
- Published by Springer Nature in Nature Reviews Genetics
- Vol. 10 (12) , 833-844
- https://doi.org/10.1038/nrg2683
Abstract
Initial transcriptome analyses were limited to the quantification of the transcripts that correspond to known features, such as mRNA genes or stable non-coding RNAs. Projects aimed at the exhaustive identification of RNAs, unbiased by genome annotations, have uncovered unexpected complexity of the transcriptome in eukaryotes. Several transcripts are generated from genomic regions that were previously thought to be silent or antisense to genes. This widespread genomic transcription is often called 'pervasive transcription'. Advances in understanding the complexity of eukaryotic transcriptomes have been driven by technological breakthroughs. Several distinct approaches have recently evolved to allow rapid, unbiased, genome-wide analyses of transcriptomes. First, DNA microarrays have been developed to the point at which tiled oligonucleotides span whole genomes (genomic tiling microarrays). Second, the development of next-generation sequencing techniques has allowed the efficient and quantitative analysis of sequence tags that either cover whole transcripts or are enriched for the 5′ or 3′ extremities of the RNAs. The new genomic approaches for transcriptome analysis have uncovered a variety of non-coding transcripts. Recently, several classes of small non-coding RNAs have been found to be associated with gene promoters in animals. Although these different classes of RNA differ in their characteristics — such as their modal length — their distribution with respect to gene transcription start sites (TSSs) is remarkably similar. Some of these small RNAs are transcribed in the same orientation as the mRNAs and are usually located a short distance downstream of the gene TSS, and others are transcribed in the opposite direction to the mRNA from upstream of the gene TSS. The origin of these promoter-associated RNAs is unknown. However, several observations point to a possible relationship with the so-called 'paused' polymerases, that is, polymerases that engage in transcription but that pause a few dozens of nucleotides downstream of the TSS. In particular, the distribution of the promoter-associated small RNAs is similar to that of 'engaged' RNA polymerase II, as determined by a novel genome-wide run-on technique. However, how the small RNAs and paused polymerases might be related remains puzzling and is far from established. In yeast, an important part of pervasive transcription gives rise to highly unstable transcripts called cryptic unstable transcripts (CUTs). These transcripts are heterogeneous at their 3′ ends and range in size from ∼200 to ∼600 nucleotides. Another class of more stable transcripts has been distinguished and named stable unannotated transcripts (SUTs), although there is not a clear demarcation between the two classes. Like the promoter-associated small RNAs found in animals, these RNAs are mostly transcribed from nucleosome-free regions, in particular those associated with gene promoters. In addition, they show a divergent distribution profile. However, this profile is not completely equivalent to that observed in animals, as their TSSs are almost exclusively located upstream of the gene TSSs, whether or not they are transcribed in the sense orientation or in divergent orientation relative to the gene. The majority of CUTs and SUTs are divergent from their associated mRNAs. One model for their origin, which is supported by mutational analysis of one example, is that the assembly of pre-initiation complexes (PICs) during transcription initiation is poorly polarized, and so cryptic PICs are often assembled in the wrong orientation relative to the gene. The transcripts they generate are efficiently degraded by an efficient quality control mechanism. The nature of the quality control mechanism that targets CUTs for rapid degradation is well understood. This mechanism is coupled to the peculiar mode of termination of transcription for these RNAs, which resembles the transcription termination of small nucleolar RNAs. This mode of termination is coupled to exonucleolytic degradation by the exosome, assisted by a novel poly(A) polymerase-containing complex called the Trf4–Air2–Mtr4p polyadenylation (TRAMP) complex. Although the role of divergent CUTs in regulation is unknown, several different specific regulation mechanisms have been described that use antisense SUTs or generate sense CUTs. How widespread the use of these unconventional regulation mechanisms is remains to be determined. Likewise, in animals, the general role of promoter-associated transcription remains enigmatic, although several different mechanisms have been described that make use of such transcripts as effectors of gene regulation. Importantly, whatever the precise mechanism is that generates promoter-associated small non-coding RNAs in yeast and animals, these studies indicate that transcription initiation is a poorly polarized process and many, if not most, promoter regions therefore seem to be intrinsically bidirectional.Keywords
This publication has 80 references indexed in Scilit:
- Pervasive transcription constitutes a new level of eukaryotic genome regulationEMBO Reports, 2009
- Evolution and Functions of Long Noncoding RNAsPublished by Elsevier ,2009
- Origins and Mechanisms of miRNAs and siRNAsCell, 2009
- The Many Pathways of RNA DegradationCell, 2009
- Mutations of RNA polymerase II activate key genes of the nucleoside triphosphate biosynthetic pathwaysThe EMBO Journal, 2008
- A Chromatin Landmark and Transcription Initiation at Most Promoters in Human CellsCell, 2007
- High-Resolution Profiling of Histone Methylations in the Human GenomePublished by Elsevier ,2007
- Cryptic Pol II Transcripts Are Degraded by a Nuclear Quality Control Pathway Involving a New Poly(A) PolymeraseCell, 2005
- Intergenic transcription is required to repress the Saccharomyces cerevisiae SER3 geneNature, 2004
- Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAsNature, 2002