Evolution of prokaryotic subtilases: Genome‐wide analysis reveals novel subfamilies with different catalytic residues
- 8 March 2007
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 67 (3) , 681-694
- https://doi.org/10.1002/prot.21290
Abstract
Subtilisin-like serine proteases (subtilases) are a very diverse family of serine proteases with low sequence homology, often limited to regions surrounding the three catalytic residues. Starting with different Hidden Markov Models (HMM), based on sequence alignments around the catalytic residues of the S8 family (subtilisins) and S53 family (sedolisins), we iteratively searched all ORFs in the complete genomes of 313 eubacteria and archaea. In 164 genomes we identified a total of 567 ORFs with one or more of the conserved regions with a catalytic residue. The large majority of these contained all three regions around the “classical” catalytic residues of the S8 family (Asp-His-Ser), while 63 proteins were identified as S53 (sedolisin) family members (Glu-Asp-Ser). More than 30 proteins were found to belong to two novel subsets with other evolutionary variations in catalytic residues, and new HMMs were generated to search for them. In one subset the catalytic Asp is replaced by an equivalent Glu (i.e. Glu-His-Ser family). The other subset resembles sedolisins, but the conserved catalytic Asp is not located on the same helix as the nucleophile Glu, but rather on a β-sheet strand in a topologically similar position, as suggested by homology modeling. The Prokaryotic Subtilase Database (www.cmbi.ru.nl/subtilases) provides access to all information on the identified subtilases, the conserved sequence regions, the proposed family subdivision, and the appropriate HMMs to search for them. Over 100 proteins were predicted to be subtilases for the first time by our improved searching methods, thereby improving genome annotation. Proteins 2007.Keywords
This publication has 41 references indexed in Scilit:
- MEROPS: the peptidase databaseNucleic Acids Research, 2007
- 1.2 Å Crystal Structure of the Serine Carboxyl Proteinase Pro-Kumamolisin: Structure of an Intact Pro-SubtilaseStructure, 2004
- Improved Prediction of Signal Peptides: SignalP 3.0Journal of Molecular Biology, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Multiple sequence alignment with the Clustal series of programsNucleic Acids Research, 2003
- Subtilases: The superfamily of subtilisin-like serine proteasesProtein Science, 1997
- Comparison of lantibiotic gene clusters and encoded proteinsAntonie van Leeuwenhoek, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Substrate specificity and kinetic properties of pepstatin-insensitive carboxyl proteinase from Pseudomanas sp. No. 101Biochimica et Biophysica Acta (BBA) - Protein Structure and Molecular Enzymology, 1992
- The signal peptideThe Journal of Membrane Biology, 1990