Lookup Peaks: A Hybrid of de Novo Sequencing and Database Search for Protein Identification by Tandem Mass Spectrometry
- 23 January 2007
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 79 (4) , 1393-1400
- https://doi.org/10.1021/ac0617013
Abstract
A powerful technique for peptide and protein identification is tandem mass spectrometry followed by database search using a program such as SEQUEST or Mascot. These programs, however, become slow and lose sensitivity when allowing nonspecific cleavages or peptide modifications. De novo sequencing and hybrid methods such as sequence tagging offer speed and robustness for wider searches, yet these approaches require better spectra with more complete and consecutive fragmentation and, hence, are less sensitive to low-abundance peptides. Here we describe a new hybrid method that retains the sensitivity of pure database search. The method uses a small amount of de novo analysis to identify likely b- and y-ion peaks“lookup peaks”that can then be used to extract candidate peptides from the database, with the number of candidates tunable to fit a computing budget. We describe a program called ByOnic that implements this method, and we benchmark ByOnic on several data sets, including one of mouse blood plasma spiked with low concentrations of recombinant human proteins. We demonstrate that ByOnic is more sensitive than sequence tagging and, indeed, more sensitive than the three most popular pure database search toolsSEQUEST, Mascot, and X!Tandemon both the peptide and protein levels. On the mouse plasma samples, ByOnic consistently found spiked proteins missed by the other tools.Keywords
This publication has 27 references indexed in Scilit:
- De Novo Analysis of Peptide Tandem Mass Spectra by Spectral Graph PartitioningJournal of Computational Biology, 2006
- A proteomic study of the HUPO Plasma Proteome Project's pilot samples using an accurate mass and time tag strategyProteomics, 2005
- Peptide Sequence Tags for Fast Database Search in Mass-SpectrometryJournal of Proteome Research, 2005
- Identification of Protein Modifications Using MS/MS de Novo Sequencing and the OpenSea Alignment AlgorithmJournal of Proteome Research, 2005
- Automated approach for quantitative analysis of complex peptide mixtures from tandem mass spectraNature Methods, 2004
- Automatic Quality Assessment of Peptide Tandem Mass SpectraBioinformatics, 2004
- Intensity-based protein identification by machine learning from a library of tandem mass spectraNature Biotechnology, 2004
- A method for reducing the time required to match protein sequences with tandem mass spectraRapid Communications in Mass Spectrometry, 2003
- Mutation-Tolerant Protein Identification by Mass SpectrometryJournal of Computational Biology, 2000
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999