A structural pattern‐based method for protein fold recognition
- 14 May 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 56 (2) , 222-234
- https://doi.org/10.1002/prot.20073
Abstract
A method (SPREK) was developed to evaluate the register of a sequence on a structure based on the matching of structural patterns against a library derived from the protein structure databank. The scores obtained were normalized against random background distributions derived from sequence shuffling and permutation methods. ‘Random’ structures were also used to evaluate the effectiveness of the method. These were generated by a simple random-walk and a more sophisticated structure prediction method that produced protein-like folds. For comparison with other methods, the performance of the method was assessed using collections of models including decoys and models from the CASP-5 exercise. The performance of SPREK on the decoy models was equivalent to (and sometimes better than) those obtained with more complex approaches. An exception was the two smallest proteins, for which SPREK did not perform well due to a lack of patterns. Using the best parameter combination from trials on decoy models, the CASP models of intermediate difficulty were evaluated by SPREK and the quality of the top scoring model was evaluated by its CASP ranking. Of the 14 targets in this class, half lie in the top 10% (out of around 140 models for each target). The two worst rankings resulted from the selection by our method of a well-packed model that was based on the wrong fold. Of the other poor rankings, one was the smallest protein and the others were the four largest (all over 250 residues). Proteins 2004.Keywords
This publication has 25 references indexed in Scilit:
- Threading Using Neural nEtwork (TUNE): the measure of protein sequence—structure compatibilityBioinformatics, 2002
- Defining linear segments in protein structure 1 1Edited by J. ThorntonJournal of Molecular Biology, 2001
- Searching the Protein Structure Databank with Weak Sequence Patterns and Structural ConstraintsJournal of Molecular Biology, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- HOMSTRAD: A database of protein structure alignments for homologous familiesProtein Science, 1998
- CAMPASS: a database of structurally aligned protein superfamiliesStructure, 1998
- Multiple sequence threading: an analysis of alignment quality and stabilityJournal of Molecular Biology, 1997
- Energy Functions that Discriminate X-ray and Near-native Folds from Well-constructed DecoysJournal of Molecular Biology, 1996
- Prediction of Protein Structure by Evaluation of Sequence-structure FitnessJournal of Molecular Biology, 1993
- Calculation of conformational ensembles from potentials of mena forceJournal of Molecular Biology, 1990