AllerHunter: A SVM-Pairwise System for Assessment of Allergenicity and Allergic Cross-Reactivity in Proteins
Open Access
- 10 June 2009
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 4 (6) , e5861
- https://doi.org/10.1371/journal.pone.0005861
Abstract
Allergy is a major health problem in industrialized countries. The number of transgenic food crops is growing rapidly creating the need for allergenicity assessment before they are introduced into human food chain. While existing bioinformatic methods have achieved good accuracies for highly conserved sequences, the discrimination of allergens and non-allergens from allergen-like non-allergen sequences remains difficult. We describe AllerHunter, a web-based computational system for the assessment of potential allergenicity and allergic cross-reactivity in proteins. It combines an iterative pairwise sequence similarity encoding scheme with SVM as the discriminating engine. The pairwise vectorization framework allows the system to model essential features in allergens that are involved in cross-reactivity, but not limited to distinct sets of physicochemical properties. The system was rigorously trained and tested using 1,356 known allergen and 13,449 putative non-allergen sequences. Extensive testing was performed for validation of the prediction models. The system is effective for distinguishing allergens and non-allergens from allergen-like non-allergen sequences. Testing results showed that AllerHunter, with a sensitivity of 83.4% and specificity of 96.4% (accuracy = 95.3%, area under the receiver operating characteristic curve AROC = 0.928±0.004 and Matthew's correlation coefficient MCC = 0.738), performs significantly better than a number of existing methods using an independent dataset of 1443 protein sequences. AllerHunter is available at http://tiger.dbs.nus.edu.sg/AllerHunterKeywords
This publication has 29 references indexed in Scilit:
- ALLERDB database and integrated bioinformatic tools for assessment of allergenicity and allergic cross-reactivityCellular Immunology, 2006
- AlgPred: prediction of allergenic proteins and mapping of IgE epitopesNucleic Acids Research, 2006
- The Value of Short Amino Acid Sequence Matches for Prediction of Protein AllergenicityToxicological Sciences, 2005
- Assessing Genetically Modified Crops to Minimize the Risk of Increased Food Allergy: A ReviewInternational Archives of Allergy and Immunology, 2005
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Supervised identification of allergen-representative peptides forin silicodetection of potentially allergenic proteinsBioinformatics, 2004
- Update on food allergy☆Journal of Allergy and Clinical Immunology, 2004
- Statistical Evaluation of Local Alignment Features Predicting Allergenicity Using Supervised Classification AlgorithmsInternational Archives of Allergy and Immunology, 2004
- Combining Pairwise Sequence Similarity and Support Vector Machines for Detecting Remote Protein Evolutionary and Structural RelationshipsJournal of Computational Biology, 2003
- The human IgE networkNature, 1993