A graph-theoretic approach for the separation of b and y ions in tandem mass spectra
Open Access
- 28 September 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (5) , 563-574
- https://doi.org/10.1093/bioinformatics/bti044
Abstract
Motivation: Ion-type identification is a fundamental problem in computational proteomics. Methods for accurate identification of ion types provide the basis for many mass spectrometry data interpretation problems, including (a) de novo sequencing, (b) identification of post-translational modifications and mutations and (c) validation of database search results. Results: Here, we present a novel graph-theoretic approach for solving the problem of separating b ions from y ions in a set of tandem mass spectra. We represent each spectral peak as a node and consider two types of edges: type-1 edge connecting two peaks probably of the same ion types and type-2 edge connecting two peaks probably of different ion types. The problem of ion-separation is formulated and solved as a graph partition problem, which is to partition the graph into three subgraphs, representing b, y and others ions, respectively, through maximizing the total weight of type-1 edges while minimizing the total weight of type-2 edges within each partitioned subgraph. We have developed a dynamic programming algorithm for rigorously solving this graph partition problem and implemented it as a computer program PRIME (PaRtition of Ion types in tandem Mass spEctra). The tests on a large amount of simulated mass spectra and 19 sets of high-quality experimental Fourier transform ion cyclotron resonance tandem mass spectra indicate that an accuracy level of ∼90% for the separation of b and y ions was achieved. Availability: The executable code of PRIME is available upon request. Contact:xyn@bmb.uga.eduKeywords
This publication has 28 references indexed in Scilit:
- Analysis, statistical validation and dissemination of large-scale proteomics datasets generated by tandem MSDrug Discovery Today, 2004
- Proteomic characterization of the human centrosome by protein correlation profilingNature, 2003
- Mass spectrometry-based proteomicsNature, 2003
- A Suboptimal Algorithm for De Novo Peptide Sequencing via Tandem Mass SpectrometryJournal of Computational Biology, 2003
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Analysis of the Plasmodium falciparum proteome by high-accuracy mass spectrometryNature, 2002
- Systematic identification of protein complexes in Saccharomyces cerevisiae by mass spectrometryNature, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994
- Sustained off-resonance irradiation for collision-activated dissociation involving Fourier transform mass spectrometry. Collision-activated dissociation technique that emulates infrared multiphoton dissociationAnalytica Chimica Acta, 1991