Comparative Genomics and Evolution of Proteins Associated with RNA Polymerase II C-Terminal Domain
Open Access
- 13 July 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (11) , 2166-2178
- https://doi.org/10.1093/molbev/msi215
Abstract
The C-terminal domain (CTD) of the largest subunit of RNA polymerase II provides an anchoring point for a wide variety of proteins involved in mRNA synthesis and processing. Most of what is known about CTD-protein interactions comes from animal and yeast models. The consensus sequence and repetitive structure of the CTD is conserved strongly across a wide range of organisms, implying that the same is true of many of its known functions. In some eukaryotic groups, however, the CTD has been allowed to degenerate, suggesting a comparable lack of essential protein interactions. To date, there has been no comprehensive examination of CTD-related proteins across the eukaryotic domain to determine which of its identified functions are correlated with strong stabilizing selection on CTD structure. Here we report a comparative investigation of genes encoding 50 CTD-associated proteins, identifying putative homologs from 12 completed or nearly completed eukaryotic genomes. The presence of a canonical CTD generally is correlated with the apparent presence and conservation of its known protein partners; however, no clear set of interactions emerges that is invariably linked to conservation of the CTD. General rates of evolution, phylogenetic patterns, and the conservation of modeled tertiary structure of capping enzyme guanylyltransferase (Cgt1) indicate a pattern of coevolution of components of a transcription factory organized around the CTD, presumably driven by common functional constraints. These constraints complicate efforts to determine orthologous gene relationships and can mislead phylogenetic and informatic algorithms.Keywords
This publication has 70 references indexed in Scilit:
- Functional Unit of the RNA Polymerase II C-Terminal Domain Lies within Heptapeptide PairsEukaryotic Cell, 2004
- SWISS-MODEL: an automated protein homology-modeling serverNucleic Acids Research, 2003
- Naf1p, an Essential Nucleoplasmic Factor Specifically Required for Accumulation of Box H/ACA Small Nucleolar RNPsMolecular and Cellular Biology, 2002
- An extensive network of coupling among gene expression machinesNature, 2002
- A Novel WD40 Repeat Protein, WDC146, Highly Expressed during Spermatogenesis in a Stage-Specific MannerBiochemical and Biophysical Research Communications, 2001
- SEQUENCES OF THE LARGEST SUBUNIT OF RNA POLYMERASE II FROM TWO RED ALGAE AND THEIR IMPLICATIONS FOR RHODOPHYTE EVOLUTIONJournal of Phycology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Quartet Puzzling: A Quartet Maximum-Likelihood Method for Reconstructing Tree TopologiesMolecular Biology and Evolution, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- NoticesCladistics, 1989