Structural and evolutionary classification of Type II restriction enzymes based on theoretical and experimental analyses
Open Access
- 2 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (11) , 3552-3569
- https://doi.org/10.1093/nar/gkn175
Abstract
For a very long time, Type II restriction enzymes (REases) have been a paradigm of ORFans: proteins with no detectable similarity to each other and to any other protein in the database, despite common cellular and biochemical function. Crystallographic analyses published until January 2008 provided high-resolution structures for only 28 of 1637 Type II REase sequences available in the Restriction Enzyme database (REBASE). Among these structures, all but two possess catalytic domains with the common PD-(D/E)XK nuclease fold. Two structures are unrelated to the others: R.BfiI exhibits the phospholipase D (PLD) fold, while R.PabI has a new fold termed ‘half-pipe’. Thus far, bioinformatic studies supported by site-directed mutagenesis have extended the number of tentatively assigned REase folds to five (now including also GIY-YIG and HNH folds identified earlier in homing endonucleases) and provided structural predictions for dozens of REase sequences without experimentally solved structures. Here, we present a comprehensive study of all Type II REase sequences available in REBASE together with their homologs detectable in the nonredundant and environmental samples databases at the NCBI. We present the summary and critical evaluation of structural assignments and predictions reported earlier, new classification of all REase sequences into families, domain architecture analysis and new predictions of three-dimensional folds. Among 289 experimentally characterized (not putative) Type II REases, whose apparently full-length sequences are available in REBASE, we assign 199 (69%) to contain the PD-(D/E)XK domain. The HNH domain is the second most common, with 24 (8%) members. When putative REases are taken into account, the fraction of PD-(D/E)XK and HNH folds changes to 48% and 30%, respectively. Fifty-six characterized (and 521 predicted) REases remain unassigned to any of the five REase folds identified so far, and may exhibit new architectures. These enzymes are proposed as the most interesting targets for structure determination by high-resolution experimental methods. Our analysis provides the first comprehensive map of sequence-structure relationships among Type II REases and will help to focus the efforts of structural and functional genomics of this large and biotechnologically important class of enzymes.Keywords
This publication has 121 references indexed in Scilit:
- Identification of a Single HNH Active Site in Type IIS Restriction Endonuclease Eco31IJournal of Molecular Biology, 2007
- Restriction endonuclease BpuJI specific for the 5′-CCCGT sequence is related to the archaeal Holliday junction resolvase familyNucleic Acids Research, 2007
- Topology of Type II REases revisited; structural classes and the common conserved coreNucleic Acids Research, 2007
- Novel protein fold discovered in the PabI family of restriction enzymesNucleic Acids Research, 2007
- Restriction endonuclease MvaI is a monomer that recognizes its target sequence asymmetricallyNucleic Acids Research, 2007
- REBASE--enzymes and genes for DNA restriction and modificationNucleic Acids Research, 2007
- MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural informationNucleic Acids Research, 2006
- Type II restriction endonucleases: structure and mechanismCellular and Molecular Life Sciences, 2005
- CLANS: a Java application for visualizing protein families based on pairwise similarityBioinformatics, 2004
- Single‐body residue‐level knowledge‐based energy score combined with sequence‐profile and secondary structure information for fold recognitionProteins-Structure Function and Bioinformatics, 2004