A Method for Assessing the Statistical Significance of Mass Spectrometry-Based Protein Identifications Using General Scoring Schemes
Top Cited Papers
- 10 January 2003
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 75 (4) , 768-774
- https://doi.org/10.1021/ac0258709
Abstract
This paper investigates the use of survival functions and expectation values to evaluate the results of protein identification experiments. These functions are standard statistical measures that can be used to reduce various protein identification scoring schemes to a common, easily interpretably representation. The relative merits of scoring systems were explored using this approach, as well as the effects of altering primary identification parameters. We would advocate the widespread use of these simple statistical measures to simplify and standardize the reporting of the confidence of protein identification results, allowing the users of different identification algorithms to compare their results in a straightforward and statistically significant manner. A method is described for measuring these distributions using information that is being discarded by most protein identification search engines, resulting in accurate survival functions that are specific to any combination of scoring algorithms, sequence databases, and mass spectra.Keywords
This publication has 11 references indexed in Scilit:
- Getting More from LessMolecular & Cellular Proteomics, 2002
- What does it mean to identify a protein in proteomics?Trends in Biochemical Sciences, 2002
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- Directed Proteomic Analysis of the Human NucleolusCurrent Biology, 2002
- Alternative nucleotide incision repair pathway for oxidative DNA damageNature, 2002
- Peptide Sequence Motif Analysis of Tandem MS Data with the SALSA AlgorithmAnalytical Chemistry, 2001
- Charting the Proteomes of Organisms with Unsequenced Genomes by MALDI-Quadrupole Time-of-Flight Mass Spectrometry and BLAST Homology SearchingAnalytical Chemistry, 2001
- Mass Spectrometry in ProteomicsChemical Reviews, 2001
- Error-Tolerant Identification of Peptides in Sequence Databases by Peptide Sequence TagsAnalytical Chemistry, 1994
- Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.Proceedings of the National Academy of Sciences, 1990