Using support vector machine combined with auto covariance to predict protein–protein interactions from protein sequences
Top Cited Papers
Open Access
- 4 April 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (9) , 3025-3030
- https://doi.org/10.1093/nar/gkn159
Abstract
Compared to the available protein sequences of different organisms, the number of revealed protein–protein interactions (PPIs) is still very limited. So many computational methods have been developed to facilitate the identification of novel PPIs. However, the methods only using the information of protein sequences are more universal than those that depend on some additional information or predictions about the proteins. In this article, a sequence-based method is proposed by combining a new feature representation using auto covariance (AC) and support vector machine (SVM). AC accounts for the interactions between residues a certain distance apart in the sequence, so this method adequately takes the neighbouring effect into account. When performed on the PPI data of yeast Saccharomyces cerevisiae, the method achieved a very promising prediction result. An independent data set of 11 474 yeast PPIs was used to evaluate this prediction model and the prediction accuracy is 88.09%. The performance of this method is superior to those of the existing sequence-based methods, so it can be a useful supplementary tool for future proteomics studies. The prediction software and all data sets used in this article are freely available at http://www.scucic.cn/Predict_PPI/index.htm.Keywords
This publication has 56 references indexed in Scilit:
- A machine learning approach for the identification of odorant binding proteins from sequence-derived propertiesBMC Bioinformatics, 2007
- A domain-based approach to predict protein-protein interactionsBMC Bioinformatics, 2007
- Predicting protein–protein interactions based only on sequences informationProceedings of the National Academy of Sciences, 2007
- Choosing negative examples for the prediction of protein-protein interactionsBMC Bioinformatics, 2006
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- Correlated sequence-signatures as markers of protein-protein interactionJournal of Molecular Biology, 2001
- A comprehensive two-hybrid analysis to explore the yeast protein interactomeProceedings of the National Academy of Sciences, 2001
- A novel genetic system to detect protein–protein interactionsNature, 1989
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981