Protein 8‐class secondary structure prediction using conditional neural fields
- 1 August 2011
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 11 (19) , 3786-3792
- https://doi.org/10.1002/pmic.201100196
Abstract
Compared with the protein 3‐class secondary structure (SS) prediction, the 8‐class prediction gains less attention and is also much more challenging, especially for proteins with few sequence homologs. This paper presents a new probabilistic method for 8‐class SS prediction using conditional neural fields (CNFs), a recently invented probabilistic graphical model. This CNF method not only models the complex relationship between sequence features and SS, but also exploits the interdependency among SS types of adjacent residues. In addition to sequence profiles, our method also makes use of non‐evolutionary information for SS prediction. Tested on the CB513 and RS126 data sets, our method achieves Q8 accuracy of 64.9 and 64.7%, respectively, which are much better than the SSpro8 web server (51.0 and 48.0%, respectively). Our method can also be used to predict other structure properties (e.g. solvent accessibility) of a protein or the SS of RNA.Keywords
Funding Information
- National Institutes of Health (R01GM089753)
- National Science Foundation (DBI-0960390)
- TeraGrid for their computational resources (TG-MCB100062, TG-CCR100005)
This publication has 33 references indexed in Scilit:
- Protein Secondary Structure PredictionPublished by Springer Nature ,2009
- Mimicking the folding pathway to improve homology-free protein structure predictionProceedings of the National Academy of Sciences, 2009
- Improved residue contact prediction using support vector machines and a large feature setBMC Bioinformatics, 2007
- Identifying sequence regions undergoing conformational change via predicted continuum secondary structureBioinformatics, 2006
- Prediction of protein continuum secondary structure with probabilistic models based on NMR solved structuresBMC Bioinformatics, 2006
- Preorganized secondary structure as an important determinant of fast protein folding.Nature Structural & Molecular Biology, 2001
- Protein folding dynamics: The diffusion‐collision model and experimental dataProtein Science, 1994
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- The structure of proteins: Two hydrogen-bonded helical configurations of the polypeptide chainProceedings of the National Academy of Sciences, 1951