A Model for Random Sampling and Estimation of Relative Protein Abundance in Shotgun Proteomics
Top Cited Papers
- 2 June 2004
- journal article
- research article
- Published by American Chemical Society (ACS) in Analytical Chemistry
- Vol. 76 (14) , 4193-4201
- https://doi.org/10.1021/ac0498563
Abstract
Proteomic analysis of complex protein mixtures using proteolytic digestion and liquid chromatography in combination with tandem mass spectrometry is a standard approach in biological studies. Data-dependent acquisition is used to automatically acquire tandem mass spectra of peptides eluting into the mass spectrometer. In more complicated mixtures, for example, whole cell lysates, data-dependent acquisition incompletely samples among the peptide ions present rather than acquiring tandem mass spectra for all ions available. We analyzed the sampling process and developed a statistical model to accurately predict the level of sampling expected for mixtures of a specific complexity. The model also predicts how many analyses are required for saturated sampling of a complex protein mixture. For a yeast-soluble cell lysate 10 analyses are required to reach a 95% saturation level on protein identifications based on our model. The statistical model also suggests a relationship between the level of sampling observed for a protein and the relative abundance of the protein in the mixture. We demonstrate a linear dynamic range over 2 orders of magnitude by using the number of spectra (spectral sampling) acquired for each protein.Keywords
This publication has 24 references indexed in Scilit:
- Proteomic characterization of the human centrosome by protein correlation profilingNature, 2003
- Nuclear Membrane Proteins with Potential Disease Links Found by Subtractive ProteomicsScience, 2003
- Proteome analyses using accurate mass and elution time peptide tags with capillary LC time-of-flight mass spectrometryJournal of the American Society for Mass Spectrometry, 2003
- A proteomics approach to understanding protein ubiquitinationNature Biotechnology, 2003
- Analysis of the Plasmodium falciparum proteome by high-accuracy mass spectrometryNature, 2002
- A proteomic view of the Plasmodium falciparum life cycleNature, 2002
- Phosphoproteome analysis by mass spectrometry and its application to Saccharomyces cerevisiaeNature Biotechnology, 2002
- Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic NetworkScience, 2001
- Direct Analysis and Identification of Proteins in Mixtures by LC/MS/MS and Database Searching at the Low-Femtomole LevelAnalytical Chemistry, 1997
- Serial Analysis of Gene ExpressionScience, 1995