The Medium‐Chain Dehydrogenase/Reductase Engineering Database: A systematic analysis of a diverse protein family to understand sequence–structure–function relationship
- 1 October 2008
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 17 (10) , 1689-1697
- https://doi.org/10.1110/ps.035428.108
Abstract
The Medium‐Chain Dehydrogenase/Reductase Engineering Database (MDRED, http://www.mdred.uni‐stuttgart.de) has been established to serve as an analysis tool for a systematic investigation of sequence–structure–function relationships. It includes sequence and structure information of 2684 and 42 medium‐chain dehydrogenases/reductases (MDRs), respectively. Although MDRs are very diverse in sequence, they have a conserved tertiary structure. MDRs are assigned to 199 homologous families and 29 superfamilies. For each family, annotated multiple sequence alignments are provided, and functionally relevant residues are annotated. Twenty‐five superfamilies were classified as zinc‐containing MDRs, four as non‐zinc‐containing MDRs. For the zinc‐containing MDRs, three subclasses were identified by systematic analysis of a variable loop region, the quaternary structure determining loop (QSDL): the class of short, medium, and long QSDL, which include 11, 3, and 5 superfamilies, respectively. The length of the QSDL is predictive for tetramer (short QSDL) and dimer (long QSDL) formation. The class of medium QSDL includes both tetrameric and dimeric MDRs. The shape of the substrate‐binding site is highly conserved in all zinc‐containing MDRs with the exception of two variable regions, the substrate recognition sites (SRS): two residues located on the QSDL (SRS1) and, for the class of long QSDL, one residue located in the catalytic domain (SRS2). The MDRED is the first online‐accessible resource of MDRs that integrates information on sequence, structure, and function. Annotation of functionally relevant residues assist the understanding of sequence–structure–function relationships. Thus, the MDRED serves as a valuable tool to identify potential hotspots for engineering properties such as substrate specificity.Keywords
This publication has 38 references indexed in Scilit:
- An efficient algorithm for large-scale detection of protein familiesNucleic Acids Research, 2002
- The Protein Data BankNucleic Acids Research, 2000
- SWISS‐MODEL and the Swiss‐Pdb Viewer: An environment for comparative protein modelingElectrophoresis, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- A Super‐Family of Medium‐Chain Dehydrogenases/Reductases (MDR)European Journal of Biochemistry, 1994
- Human alcohol dehydrogenase: Dependence of secondary alcohol oxidation on the amino acids at positions 93 and 94Biochemical and Biophysical Research Communications, 1992
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Differences between Alcohol DehydeogenasesEuropean Journal of Biochemistry, 1977
- Three-dimensional structure of horse liver alcohol dehydrogenase at 2.4 Å resolutionJournal of Molecular Biology, 1976