Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomes
- 21 April 2008
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (7) , 1133-1142
- https://doi.org/10.1101/gr.074344.107
Abstract
Recent proliferation of low-cost DNA sequencing techniques will soon lead to an explosive growth in the number of sequenced genomes and will turn manual annotations into a luxury. Mass spectrometry recently emerged as a valuable technique for proteogenomic annotations that improves on the state-of-the-art in predicting genes and other features. However, previous proteogenomic approaches were limited to a single genome and did not take advantage of analyzing mass spectrometry data from multiple genomes at once. We show that such a comparative proteogenomics approach (like comparative genomics) allows one to address the problems that remained beyond the reach of the traditional “single proteome” approach in mass spectrometry. In particular, we show how comparative proteogenomics addresses the notoriously difficult problem of “one-hit-wonders” in proteomics, improves on the existing gene prediction tools in genomics, and allows identification of rare post-translational modifications. We therefore argue that complementing DNA sequencing projects by comparative proteogenomics projects can be a viable approach to improve both genomic and proteomic annotations.Keywords
This publication has 58 references indexed in Scilit:
- Distinguishing protein-coding and noncoding genes in the human genomeProceedings of the National Academy of Sciences, 2007
- Computational prediction of proteotypic peptides for quantitative proteomicsNature Biotechnology, 2006
- Age-Related Changes in Human Crystallins Determined from Comparative Analysis of Post-translational Modifications in Young and Aged Lens: Does Deamidation Contribute to Crystallin Insolubility?Journal of Proteome Research, 2006
- Scoring proteomes with proteotypic peptide probesNature Reviews Molecular Cell Biology, 2005
- Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammalsNature, 2005
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Proteogenomic mapping as a complementary method to perform genome annotationProteomics, 2004
- Multiple sequence alignment with the Clustal series of programsNucleic Acids Research, 2003
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997