High-performance signal peptide prediction based on sequence alignment techniques
Open Access
- 12 August 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (19) , 2172-2176
- https://doi.org/10.1093/bioinformatics/btn422
Abstract
Summary: The accuracy of current signal peptide predictors is outstanding. The most successful predictors are based on neural networks and hidden Markov models, reaching a sensitivity of 99% and an accuracy of 95%. Here, we demonstrate that the popular BLASTP alignment tool can be tuned for signal peptide prediction reaching the same high level of prediction success. Alignment-based techniques provide additional benefits. In spite of high success rates signal peptide predictors yield false predictions. Simple sequences like polyvaline, for example, are predicted as signal peptides. The general architecture of learning systems makes it difficult to trace the cause of such problems. This kind of false predictions can be recognized or avoided altogether by using sequence comparison techniques. Based on these results we have implemented a public web service, called Signal-BLAST. Predictions returned by Signal-BLAST are transparent and easy to analyze. Availability: Signal-BLAST is available online at http://sigpep.services.came.sbg.ac.at/signalblast.html Contact:sippl@came.sbg.ac.atKeywords
This publication has 9 references indexed in Scilit:
- An introduction to ROC analysisPattern Recognition Letters, 2006
- Improved Prediction of Signal Peptides: SignalP 3.0Journal of Molecular Biology, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- PHYSEAN: PHYsical SEquence ANalysis for the identification of protein domains on the basis of physical and chemical properties of amino acidsBioinformatics, 1999
- A Neural Network Method for Identification of Prokaryotic and Eukaryotic Signal Peptides and Prediction of their Cleavage SitesInternational Journal of Neural Systems, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Overtraining in neural networks that interpret clinical dataClinical Chemistry, 1993
- Basic local alignment search toolJournal of Molecular Biology, 1990