Informatics Strategies for Large-Scale Novel Cross-Linking Analysis
- 3 August 2007
- journal article
- research article
- Published by American Chemical Society (ACS) in Journal of Proteome Research
- Vol. 6 (9) , 3412-3421
- https://doi.org/10.1021/pr070035z
Abstract
The detection of protein interactions in biological systems represents a significant challenge for today's technology. Chemical cross-linking provides the potential to impart new chemical bonds in a complex system that result in mass changes in a set of tryptic peptides detected by mass spectrometry. However, system complexity and cross-linking product heterogeneity have precluded widespread chemical cross-linking use for large-scale identification of protein−protein interactions. The development of mass spectrometry identifiable cross-linkers called protein interaction reporters (PIRs) has enabled on-cell chemical cross-linking experiments with product type differentiation. However, the complex datasets resultant from PIR experiments demand new informatics capabilities to allow interpretation. This manuscript details our efforts to develop such capabilities and describes the program X-links, which allows PIR product type differentiation. Furthermore, we also present the results from Monte Carlo simulation of PIR-type experiments to provide false discovery rate estimates for the PIR product type identification through observed precursor and released peptide masses. Our simulations also provide peptide identification calculations based on accurate masses and database complexity that can provide an estimation of false discovery rates for peptide identification. Overall, the calculations show a low rate of false discovery of PIR product types due to random mass matching of approximately 12% with 10 ppm mass measurement accuracy and spectral complexity resulting from 100 peptides. In addition, consideration of a reduced database resulting from stage 1 analysis of Shewanella oneidensis MR-1 containing 367 proteins resulted in a significant reduction of expected identification false discovery rate estimation compared to that from the entire Shewanella oneidensis MR-1 proteome.Keywords
This publication has 30 references indexed in Scilit:
- Profiling the Membrane Proteome of Shewanella oneidensis MR-1 with New Affinity Labeling ProbesJournal of Proteome Research, 2006
- Prediction of Error Associated with False-Positive Rate Determination for Peptide Identification in Large-Scale Proteomics Experiments Using a Combined Reverse and Forward Peptide Sequence Database StrategyJournal of Proteome Research, 2006
- Trade-Off between High Sensitivity and Increased Potential for False Positive Peptide Sequence Matches Using a Two-Dimensional Linear Ion Trap for Tandem Mass Spectrometry-Based ProteomicsJournal of Proteome Research, 2006
- Pro-CrossLink. Software Tool for Protein Cross-Linking and Mass SpectrometryAnalytical Chemistry, 2006
- Potential for False Positive Identifications from Large Databases through Tandem Mass SpectrometryJournal of Proteome Research, 2004
- A modified internal lock‐mass method for calibration of the product ions derived from sustained off‐resonance irradiation collision‐induced dissociation using a Fourier transform mass spectrometerRapid Communications in Mass Spectrometry, 2004
- Statistical Models for Protein Validation Using Tandem Mass Spectral Data and Protein Amino Acid Sequence DatabasesAnalytical Chemistry, 2004
- Algorithms for Identifying Protein Cross-Links via Tandem Mass SpectrometryJournal of Computational Biology, 2001
- Characterization of Cytochrome c Variants with High-Resolution FTICR Mass Spectrometry: Correlation of Fragmentation and StructureAnalytical Chemistry, 1995
- Interaction between photosystem I and ferredoxin. Identification by chemical cross-linking of the polypeptide which binds ferredoxinEuropean Journal of Biochemistry, 1987