P-Mod: An Algorithm and Software To Map Modifications To Peptide Sequences Using Tandem MS Data
- 5 February 2005
- journal article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 4 (2) , 358-368
- https://doi.org/10.1021/pr0498234
Abstract
The discovery of unanticipated protein modifications is one of the most challenging problems in proteomics. Whereas widely used algorithms such as Sequest and Mascot enable mapping of modifications when the mass and amino acid specificity are known, unexpected modifications cannot be identified with these tools. We have developed an algorithm and software called P-Mod, which enables discovery and sequence mapping of modifications to target proteins known to be represented in the analysis or identified by Sequest. P-Mod matches MS/MS spectra to peptide sequences in a search list. For spectra of modified peptides, P-Mod calculates mass differences between search peptide sequences and MS/MS precursors and localizes the mass shift to a sequence position in the peptide. Because modifications are detected as mass shifts, P-Mod does not require the user to guess at masses or sequence locations of modifications. P-Mod uses extreme value statistics to assign p value estimates to sequence-to-spectrum matches. The reported p values are scaled to account for the number of comparisons, so that error rates do not increase with the expanded search lists that result from incorporating potential peptide modifications. Combination of P-Mod searches from multiple LC-MS/MS analyses and multiple samples revealed previously unreported BSA modifications, including a novel decarboxymethylation or D-->G substitution at position 579 of the protein. P-Mod can serve a unique role in the identification of protein modifications both from exogenous and endogenous sources and may be useful for identifying modified protein forms as biomarkers for toxicity and disease processes.Keywords
This publication has 13 references indexed in Scilit:
- Chemical Modification of Proteins by Lipids in Diabetescclm, 2003
- Covalent Modification of Amino Acid Nucleophiles by the Lipid Peroxidation Products 4-Hydroxy-2-nonenal and 4-Oxo-2-nonenalChemical Research in Toxicology, 2002
- Error tolerant searching of uninterpreted tandem mass spectrometry dataProteomics, 2002
- Shotgun identification of protein modifications from protein complexes and lens tissueProceedings of the National Academy of Sciences, 2002
- Proteomic approaches to characterize protein modifications: new tools to study the effects of environmental exposures.Environmental Health Perspectives, 2002
- Peptide Sequence Motif Analysis of Tandem MS Data with the SALSA AlgorithmAnalytical Chemistry, 2001
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- Mass spectrometry and the age of the proteomeJournal of Mass Spectrometry, 1998
- Issues in searching molecular sequence databasesNature Genetics, 1994
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988