Spectral Probabilities and Generating Functions of Tandem Mass Spectra: A Strike against Decoy Databases
Top Cited Papers
- 3 July 2008
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 7 (8) , 3354-3363
- https://doi.org/10.1021/pr8001244
Abstract
A key problem in computational proteomics is distinguishing between correct and false peptide identifications. We argue that evaluating the error rates of peptide identifications is not unlike computing generating functions in combinatorics. We show that the generating functions and their derivatives (spectral energy and spectral probability) represent new features of tandem mass spectra that, similarly to Δ-scores, significantly improve peptide identifications. Furthermore, the spectral probability provides a rigorous solution to the problem of computing statistical significance of spectral identifications. The spectral energy/probability approach improves the sensitivity-specificity tradeoff of existing MS/MS search tools, addresses the notoriously difficult problem of “one-hit-wonders” in mass spectrometry, and often eliminates the need for decoy database searches. We therefore argue that the generating function approach has the potential to increase the number of peptide identifications in MS/MS searches.Keywords
This publication has 53 references indexed in Scilit:
- Assigning Significance to Peptides Identified by Tandem Mass Spectrometry Using Decoy DatabasesJournal of Proteome Research, 2007
- Proteomic Parsimony through Bipartite Graph Analysis Improves Accuracy and TransparencyJournal of Proteome Research, 2007
- Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometryNature Methods, 2007
- MyriMatch: Highly Accurate Tandem Mass Spectral Peptide Identification by Multivariate Hypergeometric AnalysisJournal of Proteome Research, 2007
- De Novo Peptide Identification via Tandem Mass Spectrometry and Integer Linear OptimizationAnalytical Chemistry, 2007
- De Novo Peptide Sequencing and Identification with Precision Mass SpectrometryJournal of Proteome Research, 2006
- PepHMM: A Hidden Markov Model Based Scoring Function for Mass Spectrometry Database SearchAnalytical Chemistry, 2005
- NovoHMM: A Hidden Markov Model for de Novo Peptide SequencingAnalytical Chemistry, 2005
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994