Performance Evaluation of Existing De Novo Sequencing Algorithms
- 6 October 2006
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 5 (11) , 3018-3028
- https://doi.org/10.1021/pr060222h
Abstract
Two methods have been developed for protein identification from tandem mass spectra: database searching and de novo sequencing. De novo sequencing identifies peptide directly from tandem mass spectra. Among many proposed algorithms, we evaluated the performance of the five de novo sequencing algorithms, AUDENS, Lutefisk, NovoHMM, PepNovo, and PEAKS. Our evaluation methods are based on calculation of relative sequence distance (RSD), algorithm sensitivity, and spectrum quality. We found that de novo sequencing algorithms have different performance in analyzing QSTAR and LCQ mass spectrometer data, but in general, perform better in analyzing QSTAR data than LCQ data. For the QSTAR data, the performance order of the five algorithms is PEAKS > Lutefisk, PepNovo > AUDENS, NovoHMM. The performance of PEAKS, Lutefisk, and PepNovo strongly depends on the spectrum quality and increases with an increase of spectrum quality. However, AUDENS and NovoHMM are not sensitive to the spectrum quality. Compared with other four algorithms, PEAKS has the best sensitivity and also has the best performance in the entire range of spectrum quality. For the LCQ data, the performance order is NovoHMM > PepNovo, PEAKS > Lutefisk > AUDENS. NovoHMM has the best sensitivity, and its performance is the best in the entire range of spectrum quality. But the overall performance of NovoHMM is not significantly different from the performance of PEAKS and PepNovo. AUDENS does not give a good performance in analyzing either QSTAR and LCQ data.Keywords
This publication has 27 references indexed in Scilit:
- Proteomics-Grade de Novo Sequencing ApproachJournal of Proteome Research, 2005
- NovoHMM: A Hidden Markov Model for de Novo Peptide SequencingAnalytical Chemistry, 2005
- An algorithm for interpretation of low‐energy collision‐induced dissociation product ion spectra for de novo sequencing of peptidesRapid Communications in Mass Spectrometry, 2005
- Prediction of post‐translational glycosylation and phosphorylation of proteins from the amino acid sequenceProteomics, 2004
- Intensity-based protein identification by machine learning from a library of tandem mass spectraNature Biotechnology, 2004
- New computational approaches for de novo peptide sequencing from MS/MS experimentsProceedings of the IEEE, 2002
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- Application of Correlation Analysis Techniques to Mass Spectral DataApplied Spectroscopy Reviews, 1992
- Contributions of mass spectrometry to peptide and protein structureJournal of Mass Spectrometry, 1988
- Computer-aided peptide sequencing by fast atom bombardment mass spectrometryJournal of Mass Spectrometry, 1986