Profile–profile methods provide improved fold‐recognition: A study of different profile–profile alignment methods
- 14 May 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 57 (1) , 188-197
- https://doi.org/10.1002/prot.20184
Abstract
To improve the detection of related proteins, it is often useful to include evolutionary information for both the query and target proteins. One method to include this information is by the use of profile–profile alignments, where a profile from the query protein is compared with the profiles from the target proteins. Profile–profile alignments can be implemented in several fundamentally different ways. The similarity between two positions can be calculated using a dot‐product, a probabilistic model, or an information theoretical measure. Here, we present a large‐scale comparison of different profile–profile alignment methods. We show that the profile–profile methods perform at least 30% better than standard sequence‐profile methods both in their ability to recognize superfamily‐related proteins and in the quality of the obtained alignments. Although the performance of all methods is quite similar, profile–profile methods that use a probabilistic scoring function have an advantage as they can create good alignments and show a good fold recognition capacity using the same gap‐penalties, while the other methods need to use different parameters to obtain comparable performances. Proteins 2004.Keywords
This publication has 32 references indexed in Scilit:
- Using evolutionary information for the query and target improves fold recognitionProteins-Structure Function and Bioinformatics, 2003
- SATCHMO: sequence alignment and tree construction using hidden Markov modelsBioinformatics, 2003
- Within the twilight zone: a sensitive profile-profile comparison tool based on information theoryJournal of Molecular Biology, 2002
- Comparison of sequence profiles. Strategies for structural predictions using sequence informationProtein Science, 2000
- Improving the quality of twilight‐zone alignmentsProtein Science, 2000
- Identification of related proteins on family, superfamily and fold level 1 1Edited by F. C. CohenJournal of Molecular Biology, 2000
- Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methodsJournal of Molecular Biology, 1998
- Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteinsProtein Science, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993