Machine learning approaches for prediction of linear B‐cell epitopes on proteins
- 5 April 2006
- journal article
- research article
- Published by Wiley in Journal of Molecular Recognition
- Vol. 19 (3) , 200-208
- https://doi.org/10.1002/jmr.771
Abstract
Identification and characterization of antigenic determinants on proteins has received considerable attention utilizing both, experimental as well as computational methods. For computational routines mostly structural as well as physicochemical parameters have been utilized for predicting the antigenic propensity of protein sites. However, the performance of computational routines has been low when compared to experimental alternatives. Here we describe the construction of machine learning based classifiers to enhance the prediction quality for identifying linear B-cell epitopes on proteins. Our approach combines several parameters previously associated with antigenicity, and includes novel parameters based on frequencies of amino acids and amino acid neighborhood propensities. We utilized machine learning algorithms for deriving antigenicity classification functions assigning antigenic propensities to each amino acid of a given protein sequence. We compared the prediction quality of the novel classifiers with respect to established routines for epitope scoring, and tested prediction accuracy on experimental data available for HIV proteins. The major finding is that machine learning classifiers clearly outperform the reference classification systems on the HIV epitope validation set. Copyright © 2006 John Wiley & Sons, Ltd.Keywords
This publication has 20 references indexed in Scilit:
- Benchmarking B cell epitope prediction: Underperformance of existing methodsProtein Science, 2005
- Analysis of known bacterial protein vaccine antigens reveals biased physical properties and amino acid compositionComparative and Functional Genomics, 2003
- BEPITOPE: predicting the location of continuous epitopes and patterns in proteinsJournal of Molecular Recognition, 2003
- Identification of in vivo expressed vaccine candidate antigens from Staphylococcus aureusProceedings of the National Academy of Sciences, 2002
- Antigenicity and Immunogenicity of Synthetic PeptidesBiologicals, 2001
- A semi‐empirical method for prediction of antigenic determinants on protein antigensFEBS Letters, 1990
- Prediction of sequential antigenic regions in proteinsFEBS Letters, 1985
- Prediction of chain flexibility in proteinsThe Science of Nature, 1985
- Correlation between segmental mobility and the location of antigenic determinants in proteinsNature, 1984
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981