The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues
Open Access
- 1 March 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 13 (3) , 153-165
- https://doi.org/10.1093/protein/13.3.153
Abstract
A consensus approach has been developed for identifying distant structural homologues. This is based on the CATH Dictionary of Homologous Superfamilies (DHS), a database of validated multiple structural alignments annotated with consensus functional information for evolutionary protein superfamilies (URL: http://www.biochem.ucl.ac.uk/bsm/dhs). Multiple structural alignments have been generated for 362 well-populated superfamilies in the CATH structural domain database and annotated with secondary structure, physicochemical properties, functional sequence patterns and protein–ligand interaction data. Consensus functional information for each superfamily includes descriptions and keywords extracted from SWISS-PROT and the ENZYME database. The Dictionary provides a powerful resource to validate, examine and visualize key structural and functional features of each homologous superfamily. The value of the DHS, for assessing functional variability and identifying distant evolutionary relationships, is illustrated using the pyridoxal-5′-phosphate (PLP) binding aspartate aminotransferase superfamily. The DHS also provides a tool for examining sequence–structure relationships for proteins within each fold group.Keywords
This publication has 67 references indexed in Scilit:
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- Three-dimensional structure analysis of PROSITE patterns 1 1Edited by F. E. CohenJournal of Molecular Biology, 1999
- Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methodsJournal of Molecular Biology, 1998
- Taking a Structured Approach to Understanding ProteinsScience, 1998
- Intermediate sequences increase the detection of homology between sequencesJournal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Protein Structure Comparison by Alignment of Distance MatricesJournal of Molecular Biology, 1993
- Protein structure alignmentJournal of Molecular Biology, 1989
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983