Evaluation of PSI‐BLAST alignment accuracy in comparison to structural alignments
- 1 January 2000
- journal article
- Published by Wiley in Protein Science
- Vol. 9 (11) , 2278-2284
- https://doi.org/10.1110/ps.9.11.2278
Abstract
The PSI-BLAST algorithm has been acknowledged as one of the most powerful tools for detecting remote evolutionary relationships by sequence considerations only. This has been demonstrated by its ability to recognize remote structural homologues and by the greatest coverage it enables in annotation of a complete genome. Although recognizing the correct fold of a sequence is of major importance, the accuracy of the alignment is crucial for the success of modeling one sequence by the structure of its remote homologue. Here we assess the accuracy of PSI-BLAST alignments on a stringent database of 123 structurally similar, sequence-dissimilar pairs of proteins, by comparing them to the alignments defined on a structural basis. Each protein sequence is compared to a nonredundant database of the protein sequences by PSI-BLAST. Whenever a pair member detects its pair-mate, the positions that are aligned both in the sequential and structural alignments are determined, and the alignment sensitivity is expressed as the percentage of these positions out of the structural alignment. Fifty-two sequences detected their pair-mates (for 16 pairs the success was bi-directional when either pair member was used as a query). The average percentage of correctly aligned residues per structural alignment was 43.5+/-2.2%. Other properties of the alignments were also examined, such as the sensitivity vs. specificity and the change in these parameters over consecutive iterations. Notably, there is an improvement in alignment sensitivity over consecutive iterations, reaching an average of 50.9+/-2.5% within the five iterations tested in the current study.Keywords
This publication has 37 references indexed in Scilit:
- Structure-based evaluation of sequence comparison and fold recognition alignment accuracyJournal of Molecular Biology, 2000
- Characterization of novel proteins based on known protein structuresJournal of Molecular Biology, 2000
- Comparison of sequence profiles. Strategies for structural predictions using sequence informationProtein Science, 2000
- GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequencesJournal of Molecular Biology, 1999
- Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methodsJournal of Molecular Biology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Enlarged representative set of protein structuresProtein Science, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994
- A new approach to protein fold recognitionNature, 1992
- One thousand families for the molecular biologistNature, 1992