A neural-network based method for prediction of γ-turns in proteins from multiple sequence alignment
- 1 May 2003
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 12 (5) , 923-929
- https://doi.org/10.1110/ps.0241703
Abstract
In the present study, an attempt has been made to develop a method for predicting γ‐turns in proteins. First, we have implemented the commonly used statistical and machine‐learning techniques in the field of protein structure prediction, for the prediction of γ‐turns. All the methods have been trained and tested on a set of 320 nonhomologous protein chains by a fivefold cross‐validation technique. It has been observed that the performance of all methods is very poor, having a Matthew's Correlation Coefficient (MCC) ≤ 0.06. Second, predicted secondary structure obtained from PSIPRED is used in γ‐turn prediction. It has been found that machine‐learning methods outperform statistical methods and achieve an MCC of 0.11 when secondary structure information is used. The performance of γ‐turn prediction is further improved when multiple sequence alignment is used as the input instead of a single sequence. Based on this study, we have developed a method, GammaPred, for γ‐turn prediction (MCC = 0.17). The GammaPred is a neural‐network‐based method, which predicts γ‐turns in two steps. In the first step, a sequence‐to‐structure network is used to predict the γ‐turns from multiple alignment of protein sequence. In the second step, it uses a structure‐to‐structure network in which input consists of predicted γ‐turns obtained from the first step and predicted secondary structure obtained from PSIPRED. (A Web server based on GammaPred is available at http://www.imtech.res.in/raghava/gammapred/.)Keywords
This publication has 20 references indexed in Scilit:
- Prediction of β‐turns in proteins from multiple alignment using neural networkProtein Science, 2003
- An evaluation of β-turn prediction methodsBioinformatics, 2002
- BetaTPred: prediction of β-TURNS in a protein using statistical algorithmsBioinformatics, 2002
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Prediction of the location and type of β‐turns in proteins using neural networksProtein Science, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- PROMOTIF—A program to identify and analyze structural motifs in proteinsProtein Science, 1996
- Learning representations by back-propagating errorsNature, 1986
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978