Prediction of β-Turns in Proteins Using the First-Order Markov Models

29 December 2001

journal article
research article
Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences

Vol. 42 (1) , 123-133
https://doi.org/10.1021/ci0103020

Abstract

[[abstract]]We present a method based on the first-order Markov models for predicting simple beta-turns and loops containing multiple turns in proteins. Sequences of 338 proteins in a database are divided using the published turn criteria into the following three regions, namely, the turn, the boundary, and the nonturn ones. A transition probability matrix is constructed for either the turn or the nonturn region using the weighted, transition probabilities computed for dipeptides identified from each region. There are two such matrices constructed for the boundary region since the transition probabilities for dipeptides immediately preceding or following a turn are different. The window used for scanning a protein sequence from amino (N-) to carboxyl (C-) terminal is a hexapeptide since the transition probability computed for a turn tetrapeptide is capped at both the N- and C- termini with a boundary transition probability indexed respectively from the two boundary transition matrices. A sum of the averaged product of the transition probabilities of all the hexapeptides involving each residue is computed. This is then weighted with a probability computed from assuming that all the hexapeptides are from the nonturn region to give the final prediction quantity. Both simple beta-turns and loops containing multiple turns in a protein are then identified by the rising of the prediction quantity computed. The performance of the prediction scheme or the percentage (%) of correct prediction is evaluated through computation of Matthews correlation coefficients for each protein predicted. It is found that the prediction method is capable of giving prediction results with better correlation between the percent of correct prediction and the Matthews correlation coefficients for a group of test proteins as compared with those predicted using some secondary structural prediction methods. The prediction accuracy for about 40% of proteins in the database or 50% of proteins in the test set is better than 70%. Such a percentage for the test set is reduced to 30 if the structures of all the proteins in the set are treated as unknown.[[fileno]]2050126010030[[department]]生科

This publication has 24 references indexed in Scilit:

Analysis and prediction of the different types of β-turn in proteins
Published by Elsevier ,2004
Are turns required for the folding of ribonuclease T1?
Protein Science, 1996
Detection of new genes in a bacterial genome using Markov models for three gene classes
Nucleic Acids Research, 1995
Hidden Markov Models in Computational Biology
Journal of Molecular Biology, 1994
The role of turns in the structure of an α-helical protein
Nature, 1993
Analysis of the effectiveness of proline substitutions and glycine replacements in increasing the stability of phage T4 lysozyme
Biopolymers, 1992
Turn prediction in proteins using a pattern-matching approach
Biochemistry, 1986
Stabilization of λ repressor against thermal denaturation by site‐directed Gly→Ala changes in α‐helix 3
Proteins-Structure Function and Bioinformatics, 1986
Influence of temperature on the intrinsic viscosities of proteins in random coil conformation
Biochemistry, 1974
Stereochemical criteria for polypeptides and proteins. V. Conformation of a system of three linked peptide units
Biopolymers, 1968