Predicting Intensity Ranks of Peptide Fragment Ions
- 3 March 2009
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 8 (5) , 2226-2240
- https://doi.org/10.1021/pr800677f
Abstract
Accurate modeling of peptide fragmentation is necessary for the development of robust scoring functions for peptide−spectrum matches, which are the cornerstone of MS/MS-based identification algorithms. Unfortunately, peptide fragmentation is a complex process that can involve several competing chemical pathways, which makes it difficult to develop generative probabilistic models that describe it accurately. However, the vast amounts of MS/MS data being generated now make it possible to use data-driven machine learning methods to develop discriminative ranking-based models that predict the intensity ranks of a peptide’s fragment ions. We use simple sequence-based features that get combined by a boosting algorithm into models that make peak rank predictions with high accuracy. In an accompanying manuscript, we demonstrate how these prediction models are used to significantly improve the performance of peptide identification algorithms. The models can also be useful in the design of optimal multiple reaction monitoring (MRM) transitions, in cases where there is insufficient experimental data to guide the peak selection process. The prediction algorithm can also be run independently through PepNovo+, which is available for download from http://bix.ucsd.edu/Software/PepNovo.html.Keywords
This publication has 64 references indexed in Scilit:
- A Ranking-Based Scoring Function for Peptide−Spectrum MatchesJournal of Proteome Research, 2009
- Modeling peptide fragmentation with dynamic Bayesian networks for peptide identificationBioinformatics, 2008
- Whole proteome analysis of post-translational modifications: Applications of mass-spectrometry for proteogenomic annotationGenome Research, 2007
- Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometryNature Methods, 2007
- Improving gene annotation using peptide mass spectrometryGenome Research, 2006
- De Novo Peptide Sequencing and Identification with Precision Mass SpectrometryJournal of Proteome Research, 2006
- PepHMM: A Hidden Markov Model Based Scoring Function for Mass Spectrometry Database SearchAnalytical Chemistry, 2005
- Mass spectrometry-based proteomicsNature, 2003
- A Decision-Theoretic Generalization of On-Line Learning and an Application to BoostingJournal of Computer and System Sciences, 1997
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994