On de novo interpretation of tandem mass spectra for peptide identification
- 10 April 2003
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
The correct interpretation of tandem mass spectra is a difficult problem, even when it is limited to scoring peptides against a database. De novo sequencing is considerably harder, but critical when sequence databases are incomplete or not available. In this paper we build upon earlier work due to Dancik et al., and Chen et al. to provide a dynamic programming algorithm for interpreting de novo spectra. Our method can handle most of the commonly occurring ions, including a; b; y, and their neutral losses. Additionally, we shift the emphasis away from sequencing to assigning ion types to peaks. In particular, we introduce the notion of core interpretations, which allow us to give confidence values to individual peak assignments, even in the absence of a strong interpretation. Finally, we introduce a systematic approach to evaluating de novo algorithms as a function of spectral quality. We show that our algorithm, in particular the core-interpretation, is robust in the presence of measurement error, and low fragmentation probability.Keywords
This publication has 11 references indexed in Scilit:
- Algorithms for Identifying Protein Cross-Links via Tandem Mass SpectrometryJournal of Computational Biology, 2001
- Mutation-tolerant protein identification by mass-spectrometryPublished by Association for Computing Machinery (ACM) ,2000
- De NovoPeptide Sequencing via Tandem Mass SpectrometryJournal of Computational Biology, 1999
- Protein indentification using mass spectrometric informationElectrophoresis, 1998
- Sequence database searches viade novo peptide sequencing by tandem mass spectrometryRapid Communications in Mass Spectrometry, 1997
- Error-Tolerant Identification of Peptides in Sequence Databases by Peptide Sequence TagsAnalytical Chemistry, 1994
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994
- Fast algorithm for peptide sequencing by mass spectroscopyJournal of Mass Spectrometry, 1990
- Computer program (SEQPEP) to aid in the interpretation of high-energy collision tandem mass spectra of peptidesJournal of Mass Spectrometry, 1989
- Rapid and Sensitive Protein Similarity SearchesScience, 1985