Peptide charge state determination for low-resolution tandem mass spectra
- 1 January 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings. IEEE Computational Systems Bioinformatics Conference
- p. 175-185
- https://doi.org/10.1109/csb.2005.44
Abstract
Mass spectrometry is a particularly useful technology for the rapid and robust identification of peptides and proteins in complex mixtures. Peptide sequences can be identified by correlating their observed tandem mass spectra (MS/MS) with theoretical spectra of peptides from a sequence database. Unfortunately, to perform this search the charge of the peptide must be known, and current charge-state-determination algorithms only discriminate singly-from multiply-charged spectra: distinguishing +2 from +3, for example, is unreliable. Thus, search software is forced to search multiply-charged spectra multiple times. To minimize this inefficiency, we present a support vector machine (SVM) that quickly and reliably classifies multiply-charged spectra as having either a +2 or +3 precursor peptide ion. By classifying multiply-charged spectra, we obtain a 40% reduction in search time while maintaining an average of 99% of peptide and 99% of protein identifications originally obtained from these spectra.Keywords
This publication has 13 references indexed in Scilit:
- Automatic Quality Assessment of Peptide Tandem Mass SpectraBioinformatics, 2004
- Improved peptide charge state assignmentProteomics, 2003
- Qscore: An algorithm for evaluating SEQUEST database search resultsJournal of the American Society for Mass Spectrometry, 2002
- Experimental Protein Mixture for Validating Tandem Mass Spectral AnalysisOMICS: A Journal of Integrative Biology, 2002
- Code Developments to Improve the Efficiency of Automated MS/MS Spectra InterpretationJournal of Proteome Research, 2002
- Protein Identification by SEQUESTPublished by Springer Nature ,2001
- De NovoPeptide Sequencing via Tandem Mass SpectrometryJournal of Computational Biology, 1999
- Direct Analysis and Identification of Proteins in Mixtures by LC/MS/MS and Database Searching at the Low-Femtomole LevelAnalytical Chemistry, 1997
- A training algorithm for optimal margin classifiersPublished by Association for Computing Machinery (ACM) ,1992
- Basic principles of ROC analysisSeminars in Nuclear Medicine, 1978