Prediction of Ubiquitination Sites by Using the Composition of k-Spaced Amino Acid Pairs
Open Access
- 29 July 2011
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 6 (7) , e22930
- https://doi.org/10.1371/journal.pone.0022930
Abstract
As one of the most important reversible protein post-translation modifications, ubiquitination has been reported to be involved in lots of biological processes and closely implicated with various diseases. To fully decipher the molecular mechanisms of ubiquitination-related biological processes, an initial but crucial step is the recognition of ubiquitylated substrates and the corresponding ubiquitination sites. Here, a new bioinformatics tool named CKSAAP_UbSite was developed to predict ubiquitination sites from protein sequences. With the assistance of Support Vector Machine (SVM), the highlight of CKSAAP_UbSite is to employ the composition of k-spaced amino acid pairs surrounding a query site (i.e. any lysine in a query sequence) as input. When trained and tested in the dataset of yeast ubiquitination sites (Radivojac et al, Proteins, 2010, 78: 365–380), a 100-fold cross-validation on a 1∶1 ratio of positive and negative samples revealed that the accuracy and MCC of CKSAAP_UbSite reached 73.40% and 0.4694, respectively. The proposed CKSAAP_UbSite has also been intensively benchmarked to exhibit better performance than some existing predictors, suggesting that it can be served as a useful tool to the community. Currently, CKSAAP_UbSite is freely accessible at http://protein.cau.edu.cn/cksaap_ubsite/. Moreover, we also found that the sequence patterns around ubiquitination sites are not conserved across different species. To ensure a reasonable prediction performance, the application of the current CKSAAP_UbSite should be limited to the proteome of yeast.Keywords
This publication has 39 references indexed in Scilit:
- DescFold: A web server for protein fold recognitionBMC Bioinformatics, 2009
- Computational Identification of Protein Methylation Sites through Bi-Profile Bayes Feature ExtractionPLOS ONE, 2009
- GPS 2.0, a Tool to Predict Kinase-specific Phosphorylation Sites in HierarchyMolecular & Cellular Proteomics, 2008
- GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural networkProtein Engineering, Design and Selection, 2007
- NetPhosYeast: prediction of protein phosphorylation sites in yeastBioinformatics, 2007
- PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theoryBMC Bioinformatics, 2006
- A subset of membrane-associated proteins is ubiquitinated in response to mutations in the endoplasmic reticulum degradation machineryProceedings of the National Academy of Sciences, 2003
- Predicting Membrane Protein Types Using Residue-pair Models Based on Reduced Similarity DatasetJournal of Biomolecular Structure and Dynamics, 2002
- The Ubiquitin-Proteasome Pathway and Pathogenesis of Human DiseasesAnnual Review of Medicine, 1999
- THE UBIQUITIN SYSTEMAnnual Review of Biochemistry, 1998