Analysis of protein homology by assessing the (dis)similarity in protein loop regions
- 6 August 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 57 (3) , 539-547
- https://doi.org/10.1002/prot.20237
Abstract
Two proteins are considered to have a similar fold if sufficiently many of their secondary structure elements are positioned similarly in space and are connected in the same order. Such a common structural scaffold may arise due to either divergent or convergent evolution. The intervening unaligned regions (“loops”) between the superimposable helices and strands can exhibit a wide range of similarity and may offer clues to the structural evolution of folds. One might argue that more closely related proteins differ less in their nonconserved loop regions than distantly related proteins and, at the same time, the degree of variability in the loop regions in structurally similar but unrelated proteins is higher than in homologs. Here we introduce a new measure for structural (dis)similarity in loop regions that is based on the concept of the Hausdorff metric. This measure is used to gauge protein relatedness and is tested on a benchmark of homologous and analogous protein structures. It has been shown that the new measure can distinguish homologous from analogous proteins with the same or higher accuracy than the conventional measures that are based on comparing proteins in structurally aligned regions. We argue that this result can be attributed to the higher sensitivity of the Hausdorff (dis)similarity measure in detecting particularly evident dissimilarities in structures and draw some conclusions about evolutionary relatedness of proteins in the most populated protein folds. Proteins 2004.Keywords
This publication has 47 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- Rapid evolution in conformational space: A study of loop regions in a ubiquitous GTP binding domainProtein Science, 2004
- Sequence Variations within Protein Families are Linearly Related to Structural VariationsJournal of Molecular Biology, 2002
- Threading a database of protein coresProteins-Structure Function and Bioinformatics, 1995
- Structure-guided Analysis Reveals Nine Sequence Motifs Conserved among DNA Amino-methyl-transferases, and Suggests a Catalytic Mechanism for these EnzymesJournal of Molecular Biology, 1995
- Structural Features can be Unconserved in Proteins with Similar FoldsJournal of Molecular Biology, 1994
- Empirical and Structural Models for Insertions and Deletions in the Divergent Evolution of ProteinsJournal of Molecular Biology, 1993
- Analysis of insertions/deletions in protein structuresJournal of Molecular Biology, 1992
- On the prediction of protein structure: The significance of the root-mean-square deviationJournal of Molecular Biology, 1980
- Gene duplications in the structural evolution of chymotrypsinJournal of Molecular Biology, 1979