Prediction and Classification of Human G-protein Coupled Receptors Based on Support Vector Machines

Abstract
A computational system for the prediction and classification of human G-protein coupled receptors (GPCRs) has been developed based on the support vector machine (SVM) method and protein sequence information. The feature vectors used to develop the SVM prediction models consist of statistically significant features selected from single amino acid, dipeptide, and tripeptide compositions of protein sequences. Furthermore, the length distribution difference between GPCRs and non-GPCRs has also been exploited to improve the prediction performance. The testing results with annotated human protein sequences demonstrate that this system can get good performance for both prediction and classification of human GPCRs.
Funding Information
  • Ministry of Education of the People's Republic of China (505010, CG2003-GA002)
  • National Natural Science Foundation of China (30370354, 90203011)