A structural pattern‐based method for protein fold recognition

14 May 2004

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 56 (2) , 222-234
https://doi.org/10.1002/prot.20073

Abstract

A method (SPREK) was developed to evaluate the register of a sequence on a structure based on the matching of structural patterns against a library derived from the protein structure databank. The scores obtained were normalized against random background distributions derived from sequence shuffling and permutation methods. ‘Random’ structures were also used to evaluate the effectiveness of the method. These were generated by a simple random-walk and a more sophisticated structure prediction method that produced protein-like folds. For comparison with other methods, the performance of the method was assessed using collections of models including decoys and models from the CASP-5 exercise. The performance of SPREK on the decoy models was equivalent to (and sometimes better than) those obtained with more complex approaches. An exception was the two smallest proteins, for which SPREK did not perform well due to a lack of patterns. Using the best parameter combination from trials on decoy models, the CASP models of intermediate difficulty were evaluated by SPREK and the quality of the top scoring model was evaluated by its CASP ranking. Of the 14 targets in this class, half lie in the top 10% (out of around 140 models for each target). The two worst rankings resulted from the selection by our method of a well-packed model that was based on the wrong fold. Of the other poor rankings, one was the smallest protein and the others were the four largest (all over 250 residues). Proteins 2004.

Keywords

This publication has 25 references indexed in Scilit:

Threading Using Neural nEtwork (TUNE): the measure of protein sequence—structure compatibility
Bioinformatics, 2002
Defining linear segments in protein structure 1 1Edited by J. Thornton
Journal of Molecular Biology, 2001
Searching the Protein Structure Databank with Weak Sequence Patterns and Structural Constraints
Journal of Molecular Biology, 2000
Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von Heijne
Journal of Molecular Biology, 1999
HOMSTRAD: A database of protein structure alignments for homologous families
Protein Science, 1998
CAMPASS: a database of structurally aligned protein superfamilies
Structure, 1998
Multiple sequence threading: an analysis of alignment quality and stability
Journal of Molecular Biology, 1997
Energy Functions that Discriminate X-ray and Near-native Folds from Well-constructed Decoys
Journal of Molecular Biology, 1996
Prediction of Protein Structure by Evaluation of Sequence-structure Fitness
Journal of Molecular Biology, 1993
Calculation of conformational ensembles from potentials of mena force
Journal of Molecular Biology, 1990