Finding evolutionary relations beyond superfamilies: Fold‐based superfamilies
- 1 October 2003
- journal article
- Published by Wiley in Protein Science
- Vol. 12 (10) , 2239-2251
- https://doi.org/10.1110/ps.0383603
Abstract
Superfamily classifications are based variably on similarity of sequences, global folds, local structures, or functions. We have examined the possibility of defining superfamilies purely from the viewpoint of the global fold/function relationship. For this purpose, we first classified protein domains according to the β‐sheet topology. We then introduced the concept of kinship relations among the classified β‐sheet topology by assuming that the major elementary event leading to creation of a new β‐sheet topology is either an addition or deletion of one β‐strand at the edge of an existing β‐sheet during the molecular evolution. Based on this kinship relation, a network of protein domains was constructed so that the distance between a pair of domains represents the number of evolutionary events that lead one from the other domain. We then mapped on it all known domains with a specific core chemical function (here taken, as an example, that involving ATP or its analogs). Careful analyses revealed that the domains are found distributed on the network as >20 mutually disjointed clusters. The proteins in each cluster are defined to form a fold‐based superfamily. The results indicate that >20 ATP‐binding protein superfamilies have been invented independently in the process of molecular evolution, and the conservative evolutionary diffusion of global folds and functions is the origin of the relationship between them.Keywords
This publication has 34 references indexed in Scilit:
- Insight into the Catalytic Mechanism of DNA Polymerase β: Structures of Intermediate Complexes,Biochemistry, 2001
- The Protein Data BankNucleic Acids Research, 2000
- Three-Dimensional Structure of N5-Carboxyaminoimidazole Ribonucleotide Synthetase: A Member of the ATP Grasp Protein Superfamily,Biochemistry, 1999
- Domain dislocation: a change of core structure in periplasmic binding proteins in their evolutionary historyJournal of Molecular Biology, 1999
- X-ray Crystal Structure of Glycinamide Ribonucleotide Synthetase from Escherichia coli,Biochemistry, 1998
- Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to Glutaredoxins/Thioredoxins and T 1 Ribonucleases 1 1Edited by F. CohenJournal of Molecular Biology, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Derivation of 3D coordinate templates for searching structural databases: Application to ser‐His‐Asp catalytic triads in the serine proteinases and lipasesProtein Science, 1996
- An algorithm for automatically generating protein topology cartoonsProtein Engineering, Design and Selection, 1994
- The AmidotransferasesPublished by Wiley ,1973