Optimization of the Sliding Window Size for Protein Structure Prediction
- 1 September 2006
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Sliding window based methods are relatively often applied in prediction of various aspects related to protein structure. Despite their wide spread use, researchers did not establish a standard related to the size of the window, i.e., window sizes ranging between 7 and 17 residues were used in the past. To this end, this paper performs a computational study based on a probabilistic approach that aims at finding an optimal sliding window size. The results shows that formation of helical structure can be affected by amino acids (AAs) that are up to 9 positions away in the sequence, while the formation of coils and strands can be affected by AAs that are up to 3 and 6 positions away, respectively. Overall, our results suggest that a sliding window with 19 residues is optimal for secondary structure prediction, while for a specific prediction tasks, such as prediction of p-strands, a smaller window size is sufficient. Finally, the 20 AAs are categorized into five groups based on their influence of formation of the secondary structure. The finding related to the optimal window size was confirmed based on an independent experimental study related to the prediction of secondary protein structureKeywords
This publication has 16 references indexed in Scilit:
- Assessing a novel approach for predicting local 3D protein structures from sequenceProteins-Structure Function and Bioinformatics, 2005
- Protein flexibility and rigidity predicted from sequenceProteins-Structure Function and Bioinformatics, 2005
- Prediction of protein secondary structure based on residue pair types and conformational states using dynamic programming algorithmFEBS Letters, 2005
- Genetic algorithm-based optimization of hydrophobicity tablesBioinformatics, 2005
- A simple and fast secondary structure prediction method using hidden neural networksBioinformatics, 2004
- The Protein Data BankNucleic Acids Research, 2000
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Transmembrane helices predicted at 95% accuracyProtein Science, 1995
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978