The catalytic domain of all eukaryotic cut-and-paste transposase superfamilies
- 25 April 2011
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 108 (19) , 7884-7889
- https://doi.org/10.1073/pnas.1104208108
Abstract
Cut-and-paste DNA transposable elements are major components of eukaryotic genomes and are grouped into superfamilies (e.g., hAT, P) based on sequence similarity of the element-encoded transposase. The transposases from several superfamilies possess a protein domain containing an acidic amino acid triad (DDE or DDD) that catalyzes the “cut and paste” transposition reaction. However, it was unclear whether this domain was shared by the transposases from all superfamilies. Through multiple-alignment of transposase sequences from a diverse collection of previously identified and recently annotated elements from a wide range of organisms, we identified the putative DDE/D triad for all superfamilies. Furthermore, we identified additional highly conserved amino acid residues or motifs within the DDE/D domain that together form a “signature string” that is specific to each superfamily. These conserved residues or motifs were exploited as phylogenetic characters to infer evolutionary relationships among all superfamilies. The phylogenetic analysis revealed three major groups that were not previously discerned and led us to revise the classification of several currently recognized superfamilies. Taking the data together, this study suggests that all eukaryotic cut-and-paste transposable element superfamilies have a common evolutionary origin and establishes a phylogenetic framework for all future cut-and-paste transposase comparisons.Keywords
This publication has 33 references indexed in Scilit:
- Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestansNature, 2009
- TARGeT: a web-based pipeline for retrieving and characterizing gene and transposable element families from genomic sequencesNucleic Acids Research, 2009
- Phylogenomic analyses support the monophyly of Excavata and resolve relationships among eukaryotic “supergroups”Proceedings of the National Academy of Sciences, 2009
- New Superfamilies of Eukaryotic DNA Transposons and Their Internal DivisionsMolecular Biology and Evolution, 2009
- piggyBac can bypass DNA synthesis during cut and paste transpositionThe EMBO Journal, 2008
- Genome Sequence of Aedes aegypti , a Major Arbovirus VectorScience, 2007
- The evolutionary history of human DNA transposons: Evidence for intense activity in the primate lineageGenome Research, 2007
- The map-based sequence of the rice genomeNature, 2005
- Protein structure prediction servers at University College LondonNucleic Acids Research, 2005
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997