Recurrent evolution of vertebrate transcription factors by transposase capture
Preprint
- 7 May 2020
- preprint
- Published by Cold Spring Harbor Laboratory in bioRxiv
Abstract
How genes with novel cellular functions evolve is a central biological question. Exon shuffling is one mechanism to assemble new protein architectures. Here we show that DNA transposons, which are mobile and pervasive in genomes, have provided a recurrent supply of exons and splice sites to assemble protein-coding genes in vertebrates via exon-shuffling. We find that transposase domains have been captured, primarily via alternative splicing, to form new fusion proteins at least 94 times independently over ∼350 million years of tetrapod evolution. Evolution favors fusion of transposase DNA-binding domains to host regulatory domains, especially the Krüppel-associated Box (KRAB), suggesting transposase capture frequently yields new transcriptional repressors. We show that four independently evolved KRAB-transposase fusion proteins repress gene expression in a sequence-specific fashion. Genetic knockout and rescue of the bat-specificKRABINERfusion gene in cells demonstrates that it binds its cognate transposons genome-wide and controls a vast network of genes andcis-regulatory elements. These results illustrate a powerful mechanism by which a transcription factor and its dispersed binding sites emerge at once from a transposon family.One Sentence Summary: Host-transposase fusion generates novel cellular genes, including deeply conserved and lineage specific transcription factors.Keywords
All Related Versions
- Published version: Science, 371 (6531), 797.
This publication has 84 references indexed in Scilit:
- The conserved Cockayne syndrome B-piggyBac fusion protein (CSB-PGBD3) affects DNA repair and induces both interferon-like and innate antiviral responses in CSB-null cellsDNA Repair, 2012
- Fast gapped-read alignment with Bowtie 2Nature Methods, 2012
- The genome of the green anole lizard and a comparative analysis with birds and mammalsNature, 2011
- The catalytic domain of all eukaryotic cut-and-paste transposase superfamiliesProceedings of the National Academy of Sciences, 2011
- Evolution of an antifreeze protein by neofunctionalization under escape from adaptive conflictProceedings of the National Academy of Sciences, 2010
- Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell IdentitiesMolecular Cell, 2010
- Molecular Architecture of the Mos1 Paired-End Complex: The Structural Basis of DNA Transposition in a EukaryoteCell, 2009
- Transposable elements and the evolution of regulatory networksNature Reviews Genetics, 2008
- Massive amplification of rolling-circle transposons in the lineage of the bat Myotis lucifugusProceedings of the National Academy of Sciences, 2007
- Birth of a chimeric primate gene by capture of the transposase gene from a mobile elementProceedings of the National Academy of Sciences, 2006