Database searching by flexible protein structure alignment
Open Access
- 1 July 2004
- journal article
- Published by Wiley in Protein Science
- Vol. 13 (7) , 1841-1850
- https://doi.org/10.1110/ps.03602304
Abstract
We have recently developed a flexible protein structure alignment program (FATCAT) that identifies structural similarity, at the same time accounting for flexibility of protein structures. One of the most important applications of a structure alignment method is to aid in functional annotations by identifying similar structures in large structural databases. However, none of the flexible structure alignment methods were applied in this task because of a lack of significance estimation of flexible alignments. In this paper, we developed an estimate of the statistical significance of FATCAT alignment score, allowing us to use it as a database‐searching tool. The results reported here show that (1) the distribution of the similarity score of FATCAT alignment between two unrelated protein structures follows the extreme value distribution (EVD), adding one more example to the current collection of EVDs of sequence and structure similarities; (2) introducing flexibility into structure comparison only slightly influences the sensitivity and specificity of identifying similar structures; and (3) the overall performance of FATCAT as a database searching tool is comparable to that of the widely used rigid‐body structure comparison programs DALI and CE. Two examples illustrating the advantages of using flexible structure alignments in database searching are also presented. The conformational flexibilities that were detected in the first example may be involved with substrate specificity, and the conformational flexibilities detected in the second example may reflect the evolution of structures by block building.Keywords
This publication has 39 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- Classification schemes for protein structure and functionNature Reviews Genetics, 2003
- Crystal Structure of Tabtoxin Resistance Protein Complexed with Acetyl Coenzyme A Reveals the Mechanism for β-Lactam AcetylationJournal of Molecular Biology, 2003
- An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distanceJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Empirical statistical estimates for sequence similarity searchesJournal of Molecular Biology, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- The use of the area under the ROC curve in the evaluation of machine learning algorithmsPattern Recognition, 1997
- Threading a database of protein coresProteins-Structure Function and Bioinformatics, 1995
- Protein Structure Comparison by Alignment of Distance MatricesJournal of Molecular Biology, 1993