A neural-network based method for prediction of γ-turns in proteins from multiple sequence alignment

1 May 2003

journal article
research article
Published by Wiley in Protein Science

Vol. 12 (5) , 923-929
https://doi.org/10.1110/ps.0241703

Abstract

In the present study, an attempt has been made to develop a method for predicting γ‐turns in proteins. First, we have implemented the commonly used statistical and machine‐learning techniques in the field of protein structure prediction, for the prediction of γ‐turns. All the methods have been trained and tested on a set of 320 nonhomologous protein chains by a fivefold cross‐validation technique. It has been observed that the performance of all methods is very poor, having a Matthew's Correlation Coefficient (MCC) ≤ 0.06. Second, predicted secondary structure obtained from PSIPRED is used in γ‐turn prediction. It has been found that machine‐learning methods outperform statistical methods and achieve an MCC of 0.11 when secondary structure information is used. The performance of γ‐turn prediction is further improved when multiple sequence alignment is used as the input instead of a single sequence. Based on this study, we have developed a method, GammaPred, for γ‐turn prediction (MCC = 0.17). The GammaPred is a neural‐network‐based method, which predicts γ‐turns in two steps. In the first step, a sequence‐to‐structure network is used to predict the γ‐turns from multiple alignment of protein sequence. In the second step, it uses a structure‐to‐structure network in which input consists of predicted γ‐turns obtained from the first step and predicted secondary structure obtained from PSIPRED. (A Web server based on GammaPred is available at http://www.imtech.res.in/raghava/gammapred/.)

Keywords

This publication has 20 references indexed in Scilit:

Prediction of β‐turns in proteins from multiple alignment using neural network
Protein Science, 2003
An evaluation of β-turn prediction methods
Bioinformatics, 2002
BetaTPred: prediction of β-TURNS in a protein using statistical algorithms
Bioinformatics, 2002
Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von Heijne
Journal of Molecular Biology, 1999
Prediction of the location and type of β‐turns in proteins using neural networks
Protein Science, 1999
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
PROMOTIF—A program to identify and analyze structural motifs in proteins
Protein Science, 1996
Learning representations by back-propagating errors
Nature, 1986
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Biopolymers, 1983
Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins
Journal of Molecular Biology, 1978