Prediction of β-Turns in Proteins Using the First-Order Markov Models
- 29 December 2001
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Computer Sciences
- Vol. 42 (1) , 123-133
- https://doi.org/10.1021/ci0103020
Abstract
[[abstract]]We present a method based on the first-order Markov models for predicting simple beta-turns and loops containing multiple turns in proteins. Sequences of 338 proteins in a database are divided using the published turn criteria into the following three regions, namely, the turn, the boundary, and the nonturn ones. A transition probability matrix is constructed for either the turn or the nonturn region using the weighted, transition probabilities computed for dipeptides identified from each region. There are two such matrices constructed for the boundary region since the transition probabilities for dipeptides immediately preceding or following a turn are different. The window used for scanning a protein sequence from amino (N-) to carboxyl (C-) terminal is a hexapeptide since the transition probability computed for a turn tetrapeptide is capped at both the N- and C- termini with a boundary transition probability indexed respectively from the two boundary transition matrices. A sum of the averaged product of the transition probabilities of all the hexapeptides involving each residue is computed. This is then weighted with a probability computed from assuming that all the hexapeptides are from the nonturn region to give the final prediction quantity. Both simple beta-turns and loops containing multiple turns in a protein are then identified by the rising of the prediction quantity computed. The performance of the prediction scheme or the percentage (%) of correct prediction is evaluated through computation of Matthews correlation coefficients for each protein predicted. It is found that the prediction method is capable of giving prediction results with better correlation between the percent of correct prediction and the Matthews correlation coefficients for a group of test proteins as compared with those predicted using some secondary structural prediction methods. The prediction accuracy for about 40% of proteins in the database or 50% of proteins in the test set is better than 70%. Such a percentage for the test set is reduced to 30 if the structures of all the proteins in the set are treated as unknown.[[fileno]]2050126010030[[department]]生科This publication has 24 references indexed in Scilit:
- Analysis and prediction of the different types of β-turn in proteinsPublished by Elsevier ,2004
- Are turns required for the folding of ribonuclease T1?Protein Science, 1996
- Detection of new genes in a bacterial genome using Markov models for three gene classesNucleic Acids Research, 1995
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994
- The role of turns in the structure of an α-helical proteinNature, 1993
- Analysis of the effectiveness of proline substitutions and glycine replacements in increasing the stability of phage T4 lysozymeBiopolymers, 1992
- Turn prediction in proteins using a pattern-matching approachBiochemistry, 1986
- Stabilization of λ repressor against thermal denaturation by site‐directed Gly→Ala changes in α‐helix 3Proteins-Structure Function and Bioinformatics, 1986
- Influence of temperature on the intrinsic viscosities of proteins in random coil conformationBiochemistry, 1974
- Stereochemical criteria for polypeptides and proteins. V. Conformation of a system of three linked peptide unitsBiopolymers, 1968