MassMatrix: A database search program for rapid characterization of proteins and peptides from tandem mass spectrometry data
- 17 March 2009
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 9 (6) , 1548-1555
- https://doi.org/10.1002/pmic.200700322
Abstract
MassMatrix is a program that matches tandem mass spectra with theoretical peptide sequences derived from a protein database. The program uses a mass accuracy sensitive probabilistic score model to rank peptide matches. The MS/MS search software was evaluated by use of a high mass accuracy dataset and its results compared with those from MASCOT, SEQUEST, X!Tandem, and OMSSA. For the high mass accuracy data, MassMatrix provided better sensitivity than MASCOT, SEQUEST, X!Tandem, and OMSSA for a given specificity and the percentage of false positives was 2%. More importantly all manually validated true positives corresponded to a unique peptide/spectrum match. The presence of decoy sequence and additional variable PTMs did not significantly affect the results from the high mass accuracy search. MassMatrix performs well when compared with MASCOT, SEQUEST, X!Tandem, and OMSSA with regard to search time. MassMatrix was also run on a distributed memory clusters and achieved search speeds of ∼100 000 spectra per hour when searching against a complete human database with eight variable modifications. The algorithm is available for public searches at http://www.massmatrix.net.Keywords
This publication has 40 references indexed in Scilit:
- A mass accuracy sensitive probability based scoring algorithm for database searching of tandem mass spectrometry dataBMC Bioinformatics, 2007
- Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometryNature Methods, 2007
- Verification of automated peptide identifications from proteomic tandem mass spectraNature Protocols, 2006
- Optimization and Use of Peptide Mass Measurement Accuracy in Shotgun ProteomicsMolecular & Cellular Proteomics, 2006
- Open Source System for Analyzing, Validating, and Storing Protein Identification DataJournal of Proteome Research, 2004
- The International Protein Index: An integrated database for proteomics experimentsProteomics, 2004
- Statistical Models for Protein Validation Using Tandem Mass Spectral Data and Protein Amino Acid Sequence DatabasesAnalytical Chemistry, 2004
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002
- SALSA: A Pattern Recognition Algorithm To Detect Electrophile-Adducted Peptides by Automated Evaluation of CID Spectra in LC−MS−MS AnalysesAnalytical Chemistry, 2001
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994