Improving disulfide connectivity prediction with sequential distance between oxidized cysteines
Open Access
- 13 October 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (24) , 4416-4419
- https://doi.org/10.1093/bioinformatics/bti715
Abstract
Summary: Predicting disulfide connectivity precisely helps towards the solution of protein structure prediction. In this study, a descriptor derived from the sequential distance between oxidized cysteines (denoted as DOC) is proposed. An approach using support vector machine (SVM) method based on weighted graph matching was further developed to predict the disulfide connectivity pattern in proteins. When DOC was applied, prediction accuracy of 63% for our SVM models could be achieved, which is significantly higher than those obtained from previous approaches. The results show that using the non-local descriptor DOC coupled with local sequence profiles significantly improves the prediction accuracy. These improvements demonstrate that DOC, with a proper scaling scheme, is an effective feature for the prediction of disulfide connectivity. The method developed in this work is available at the web server PreCys (prediction of cys–cys linkages of proteins). Availability: Contact:cykao@csie.ntu.edu.tw Supplementary information: Supplementary data, detailed results, tables and information are available atKeywords
This publication has 10 references indexed in Scilit:
- Disulfide connectivity prediction using secondary structure information and diresidue frequenciesBioinformatics, 2005
- Cysteine separations profiles on protein sequences infer disulfide connectivityBioinformatics, 2004
- Disulfide connectivity prediction using recursive neural networks and evolutionary informationBioinformatics, 2004
- Prediction of disulfide connectivity in proteinsBioinformatics, 2001
- Probabilities for SV MachinesPublished by MIT Press ,2000
- Disulfide Bonds and Protein FoldingBiochemistry, 2000
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- Ab Initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions 1 1Edited by F. CohenJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Analysis and Classification of Disulphide Connectivity in ProteinsJournal of Molecular Biology, 1994