Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures

Top Cited Papers

17 March 2009

journal article
research article
Published by Wiley in Proteomics

Vol. 9 (6) , 1696-1719
https://doi.org/10.1002/pmic.200800564

Abstract

A novel database search algorithm is presented for the qualitative identification of proteins over a wide dynamic range, both in simple and complex biological samples. The algorithm has been designed for the analysis of data originating from data independent acquisitions, whereby multiple precursor ions are fragmented simultaneously. Measurements used by the algorithm include retention time, ion intensities, charge state, and accurate masses on both precursor and product ions from LC‐MS data. The search algorithm uses an iterative process whereby each iteration incrementally increases the selectivity, specificity, and sensitivity of the overall strategy. Increased specificity is obtained by utilizing a subset database search approach, whereby for each subsequent stage of the search, only those peptides from securely identified proteins are queried. Tentative peptide and protein identifications are ranked and scored by their relative correlation to a number of models of known and empirically derived physicochemical attributes of proteins and peptides. In addition, the algorithm utilizes decoy database techniques for automatically determining the false positive identification rates. The search algorithm has been tested by comparing the search results from a four‐protein mixture, the same four‐protein mixture spiked into a complex biological background, and a variety of other “system” type protein digest mixtures. The method was validated independently by data dependent methods, while concurrently relying on replication and selectivity. Comparisons were also performed with other commercially and publicly available peptide fragmentation search algorithms. The presented results demonstrate the ability to correctly identify peptides and proteins from data independent acquisition strategies with high sensitivity and specificity. They also illustrate a more comprehensive analysis of the samples studied; providing approximately 20% more protein identifications, compared to a more conventional data directed approach using the same identification criteria, with a concurrent increase in both sequence coverage and the number of modified peptides.

Keywords

This publication has 71 references indexed in Scilit:

The detection, correlation, and comparison of peptide precursor and product ions from data independent LC‐MS with data dependant LC‐MS/MS
Proteomics, 2009
In-Depth Proteomic Profiling of the Normal Human Kidney Glomerulus Using Two-Dimensional Protein Prefractionation in Combination with Liquid Chromatography-Tandem Mass Spectrometry
Journal of Proteome Research, 2007
High-Speed Data Reduction, Feature Detection, and MS/MS Spectrum Quality Assessment of Shotgun Proteomics Data Sets Using High-Resolution Mass Spectrometry
Analytical Chemistry, 2007
Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry
Nature Methods, 2007
The utility of ETD mass spectrometry in proteomic analysis
Published by Elsevier ,2006
Improved Peptide Elution Time Prediction for Reversed-Phase Liquid Chromatography-MS by Incorporating Peptide Sequence Information
Analytical Chemistry, 2006
Open Source System for Analyzing, Validating, and Storing Protein Identification Data
Journal of Proteome Research, 2004
ProbSeq—a fragmentation model for interpretation of electrospray tandem mass spectrometry data
Comparative and Functional Genomics, 2004
Mass spectrometry-based proteomics
Nature, 2003
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database
Journal of the American Society for Mass Spectrometry, 1994