Ortho-proteogenomics: Multiple proteomes investigation through orthology and a new MS-based protocol
- 27 October 2008
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (1) , 128-135
- https://doi.org/10.1101/gr.081901.108
Abstract
The progress in sequencing technologies irrigates biology with an ever-increasing number of genome sequences. In most cases, the gene repertoire is predicted in silico and conceptually translated into proteins. As recently highlighted, the predicted genes exhibit frequent errors, particularly in start codons, with a serious impact on subsequent biological studies. A new “ortho-proteogenomic” approach is presented here for the annotation refinement of multiple genomes at once. It combines comparative genomics with an original proteomic protocol that allows the characterization of both N-terminal and internal peptides in a single experiment. This strategy was applied to the Mycobacterium genus with Mycobacterium smegmatis as the reference, and identified 946 distinct proteins, including 443 characterized N termini. These experimental data allowed the correction of 19% of the characterized start codons, the identification of 29 proteins missed during the annotation process, and the curation, thanks to comparative genomics, of 4328 sequences of 16 other Mycobacterium proteomes.This publication has 41 references indexed in Scilit:
- Comparative proteogenomics: Combining mass spectrometry and comparative genomics to analyze multiple genomesGenome Research, 2008
- Improving gene annotation using peptide mass spectrometryGenome Research, 2006
- Re-annotation of the genome sequence of Mycobacterium tuberculosis H37RvMicrobiology, 2002
- Parallel Identification of New Genes in Saccharomyces cerevisiaeGenome Research, 2002
- Mass spectrometry allows direct identification of proteins in large genomesProteomics, 2001
- Interrogating the human genome using uninterpreted mass spectrometry dataProteomics, 2001
- Mass spectrometric evidence for mechanisms of fragmentation of charge-derivatized peptidesJournal of the American Society for Mass Spectrometry, 2001
- Genomics and Bacterial PathogenesisEmerging Infectious Diseases, 2000
- Operons in Escherichia coli : Genomic analyses and predictionsProceedings of the National Academy of Sciences, 2000
- Investigation of the tris(trimethoxyphenyl)phosphonium acetyl charged derivatives of peptides by electrospray ionization mass spectrometry and tandem mass spectrometryJournal of the American Society for Mass Spectrometry, 2000