The natural history of the WRKY–GCM1 zinc fingers and the relationship between transcription factors and transposons
Open Access
- 27 November 2006
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (22) , 6505-6520
- https://doi.org/10.1093/nar/gkl888
Abstract
WRKY and GCM1 are metal chelating DNA-binding domains (DBD) which share a four stranded fold. Using sensitive sequence searches, we show that this WRKY-GCM1 fold is also shared by the FLYWCH Zn-finger domain and the DBDs of two classes of Mutator-like element (MULE) transposases. We present evidence that they share a stabilizing core, which suggests a possible origin from a BED finger-like intermediate that was in turn ultimately derived from a C2H2 Zn-finger domain. Through a systematic study of the phyletic pattern, we show that this WRKY-GCM1 superfamily is a widespread eukaryote-specific group of transcription factors (TFs). We identified several new members across diverse eukaryotic lineages, including potential TFs in animals, fungi and Entamoeba. By integrating sequence, structure, gene expression and transcriptional network data, we present evidence that at least two major global regulators belonging to this superfamily in Saccharomyces cerevisiae (Rcs1p and Aft2p) have evolved from transposons, and attained the status of transcription regulatory hubs in recent course of ascomycete yeast evolution. In plants, we show that the lineage-specific expansion of WRKY-GCM1 domain proteins acquired functional diversity mainly through expression divergence rather than by protein sequence divergence. We also use the WRKY-GCM1 superfamily as an example to illustrate the importance of transposons in the emergence of new TFs in different lineages.Keywords
This publication has 92 references indexed in Scilit:
- Comparative genomics and structural biology of the molecular innovations of eukaryotesCurrent Opinion in Structural Biology, 2006
- The genome of the social amoeba Dictyostelium discoideumNature, 2005
- A gene expression map of Arabidopsis thaliana developmentNature Genetics, 2005
- Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensionsActa Crystallographica Section D-Biological Crystallography, 2004
- Gene regulatory network growth by duplicationNature Genetics, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Members of the Arabidopsis WRKY Group III Transcription Factors Are Part of Different Plant Defense Signaling PathwaysMolecular Plant-Microbe Interactions®, 2003
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modelingElectrophoresis, 1997