POPI: predicting immunogenicity of MHC class I binding peptides by mining informative physicochemical properties
Open Access
- 24 March 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (8) , 942-949
- https://doi.org/10.1093/bioinformatics/btm061
Abstract
Motivation: Both modeling of antigen-processing pathway including major histocompatibility complex (MHC) binding and immunogenicity prediction of those MHC-binding peptides are essential to develop a computer-aided system of peptide-based vaccine design that is one goal of immunoinformatics. Numerous studies have dealt with modeling the immunogenic pathway but not the intractable problem of immunogenicity prediction due to complex effects of many intrinsic and extrinsic factors. Moderate affinity of the MHC–peptide complex is essential to induce immune responses, but the relationship between the affinity and peptide immunogenicity is too weak to use for predicting immunogenicity. This study focuses on mining informative physicochemical properties from known experimental immunogenicity data to understand immune responses and predict immunogenicity of MHC-binding peptides accurately. Results: This study proposes a computational method to mine a feature set of informative physicochemical properties from MHC class I binding peptides to design a support vector machine (SVM) based system (named POPI) for the prediction of peptide immunogenicity. High performance of POPI arises mainly from an inheritable bi-objective genetic algorithm, which aims to automatically determine the best number m out of 531 physicochemical properties, identify these m properties and tune SVM parameters simultaneously. The dataset consisting of 428 human MHC class I binding peptides belonging to four classes of immunogenicity was established from MHCPEP, a database of MHC-binding peptides (Brusic et al., 1998). POPI, utilizing the m = 23 selected properties, performs well with the accuracy of 64.72% using leave-one-out cross-validation, compared with two sequence alignment-based prediction methods ALIGN (54.91%) and PSI-BLAST (53.23%). POPI is the first computational system for prediction of peptide immunogenicity based on physicochemical properties. Availability: A web server for prediction of peptide immunogenicity (POPI) and the used dataset of MHC class I binding peptides (PEPMHCI) are available at http://iclab.life.nctu.edu.tw/POPI Contact:syho@mail.nctu.edu.twKeywords
This publication has 30 references indexed in Scilit:
- Prediction of protein structural class with Rough SetsBMC Bioinformatics, 2006
- Integrated modeling of the major events in the MHC class I antigen processing pathwayProtein Science, 2005
- Pcleavage: an SVM based method for prediction of constitutive proteasome and immunoproteasome cleavage sites in antigenic sequencesNucleic Acids Research, 2005
- Benchmarking B cell epitope prediction: Underperformance of existing methodsProtein Science, 2005
- Analysis and prediction of affinity of TAP binding peptides using cascade SVMProtein Science, 2004
- Prediction of MHC class I binding peptides, using SVMHCBMC Bioinformatics, 2002
- MHCPEP, a database of MHC-binding peptides: update 1997Nucleic Acids Research, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Statistical comparison of established T-cell epitope predictors against a large database of human and murine antigensMolecular Immunology, 1996
- Efficient MHC class I-peptide binding is required but does not ensure MHC class I-restricted immunogenicityMolecular Immunology, 1994