Application of Peptide LC Retention Time Information in a Discriminant Function for Peptide Identification by Tandem Mass Spectrometry
- 9 July 2004
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 3 (4) , 760-769
- https://doi.org/10.1021/pr049965y
Abstract
We describe the application of a peptide retention time reversed phase liquid chromatography (RPLC) prediction model previously reported (Petritis et al. Anal. Chem. 2003, 75, 1039) for improved peptide identification. The model uses peptide sequence information to generate a theoretical (predicted) elution time that can be compared with the observed elution time. Using data from a set of known proteins, the retention time parameter was incorporated into a discriminant function for use with tandem mass spectrometry (MS/MS) data analyzed with the peptide/protein identification program SEQUEST. For singly charged ions, the number of confident identifications increased by 12% when the elution time metric is included compared to when mass spectral data is the sole source of information in the context of a Drosophila melanogaster database. A 3−4% improvement was obtained for doubly and triply charged ions for the same biological system. Application to the larger Rattusnorvegicus (rat) and human proteome databases resulted in an 8−9% overall increase in the number of confident identifications, when both the discriminant function and elution time are used. The effect of adding “runner-up” hits (peptide matches that are not the highest scoring for a spectra) from SEQUEST is also explored, and we find that the number of confident identifications is further increased by 1% when these hits are also considered. Finally, application of the discriminant functions derived in this work with ∼2.2 million spectra from over three hundred LC−MS/MS analyses of peptides from human plasma protein resulted in a 16% increase in confident peptide identifications (9022 vs 7779) using elution time information. Further improvements from the use of elution time information can be expected as both the experimental control of elution time reproducibility and the predictive capability are improved. Keywords: bioinformatics • proteome • algorithm • accurate mass and time tag • multivariate statistics • capillary liquid-chromatography • retention time • FTICRKeywords
This publication has 28 references indexed in Scilit:
- Protein identification by liquid chromatography–mass spectrometry using retention time predictionJournal of Chromatography B, 2004
- A method for reducing the time required to match protein sequences with tandem mass spectraRapid Communications in Mass Spectrometry, 2003
- The Human Plasma ProteomeMolecular & Cellular Proteomics, 2002
- Evaluation of Multidimensional Chromatography Coupled with Tandem Mass Spectrometry (LC/LC−MS/MS) for Large-Scale Protein Analysis: The Yeast ProteomeJournal of Proteome Research, 2002
- A proteomic view of the Plasmodium falciparum life cycleNature, 2002
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- Advanced mass spectrometric methods for the rapid and quantitative characterization of proteomesComparative and Functional Genomics, 2002
- Review: The Use of Accurate Mass Tags for High-Throughput Microbial ProteomicsOMICS: A Journal of Integrative Biology, 2002
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- High Throughput Proteome-Wide Precision Measurements of Protein Expression Using Mass SpectrometryJournal of the American Chemical Society, 1999