Deterministic protein inference for shotgun proteomics data provides new insights into Arabidopsis pollen development and function
- 22 June 2009
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (10) , 1786-1800
- https://doi.org/10.1101/gr.089060.108
Abstract
Pollen, the male gametophyte of flowering plants, represents an ideal biological system to study developmental processes, such as cell polarity, tip growth, and morphogenesis. Upon hydration, the metabolically quiescent pollen rapidly switches to an active state, exhibiting extremely fast growth. This rapid switch requires relevant proteins to be stored in the mature pollen, where they have to retain functionality in a desiccated environment. Using a shotgun proteomics approach, we unambiguously identified ∼3500 proteins in Arabidopsis pollen, including 537 proteins that were not identified in genetic or transcriptomic studies. To generate this comprehensive reference data set, which extends the previously reported pollen proteome by a factor of 13, we developed a novel deterministic peptide classification scheme for protein inference. This generally applicable approach considers the gene model–protein sequence–protein accession relationships. It allowed us to classify and eliminate ambiguities inherently associated with any shotgun proteomics data set, to report a conservative list of protein identifications, and to seamlessly integrate data from previous transcriptomics studies. Manual validation of proteins unambiguously identified by a single, information-rich peptide enabled us to significantly reduce the false discovery rate, while keeping valuable identifications of shorter and lower abundant proteins. Bioinformatic analyses revealed a higher stability of pollen proteins compared to those of other tissues and implied a protein family of previously unknown function in vesicle trafficking. Interestingly, the pollen proteome is most similar to that of seeds, indicating physiological similarities between these developmentally distinct tissues.This publication has 82 references indexed in Scilit:
- Comprehensive mass-spectrometry-based proteome quantification of haploid versus diploid yeastNature, 2008
- A distinct mechanism regulating a pollen-specific guanine nucleotide exchange factor for the small GTPase Rop in Arabidopsis thalianaProceedings of the National Academy of Sciences, 2007
- Proteomic Parsimony through Bipartite Graph Analysis Improves Accuracy and TransparencyJournal of Proteome Research, 2007
- Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometryNature Methods, 2007
- Computational prediction of proteotypic peptides for quantitative proteomicsNature Biotechnology, 2006
- Quantification of protein half-lives in the budding yeast proteomeProceedings of the National Academy of Sciences, 2006
- Scoring proteomes with proteotypic peptide probesNature Reviews Molecular Cell Biology, 2005
- A gene expression map of Arabidopsis thaliana developmentNature Genetics, 2005
- The FunCat, a functional annotation scheme for systematic classification of proteins from whole genomesNucleic Acids Research, 2004
- Empirical Statistical Model To Estimate the Accuracy of Peptide Identifications Made by MS/MS and Database SearchAnalytical Chemistry, 2002