Analysis of the cytosolic proteome of Halobacterium salinarum and its implication for genome annotation
- 25 January 2005
- journal article
- research article
- Published by Wiley in Proteomics
- Vol. 5 (1) , 168-179
- https://doi.org/10.1002/pmic.200400910
Abstract
The halophilic archaeon Halobacterium salinarum (strain R1, DSM 671) contains 2784 protein‐coding genes as derived from the genome sequence. The cytosolic proteome containing 2042 proteins was separated by two‐dimensional gel electrophoresis (2‐DE) and systematically analyzed by a semi‐automatic procedure. A reference map was established taking into account the narrow isoelectric point (pI) distribution of halophilic proteins between 3.5 and 5.5. Proteins were separated on overlapping gels covering the essential areas of pI and molecular weight. Every silver‐stained spot was analyzed resulting in 661 identified proteins out of about 1800 different protein spots using matrix‐assisted laser desorption/ionization time of flight mass spectrometry (MALDI‐TOF MS) peptide mass fingerprinting (PMF). There were 94 proteins that were found in multiple spots, indicating post‐translational modification. An additional 141 soluble proteins were identified on 2‐D gels not corresponding to the reference map. Thus about 40% of the cytosolic proteome was identified. In addition to the 2784 protein‐coding genes, the H. salinarum genome contains more than 6000 spurious open reading frames longer than 100 codons. Proteomic information permitted an improvement in genome annotation by validating and correcting gene assignments. The correlation between theoretical pI and gel position is exceedingly good and was used as a tool to improve start codon assignments. The fraction of identified chromosomal proteins was much higher than that of those encoded on the plasmids. In combination with analysis of the GC content this observation permitted an unambiguous identification of an episomal insert of 60 kbp (“AT‐rich island”) in the chromosome, as well as a 70 kbp region from the chromosome that has integrated into one of the megaplasmids and carries a series of essential genes. About 63% of the chromosomally encoded proteins larger than 25 kDa were identified, proving the efficacy of 2‐DE MALDI‐TOF MS PMF technology. The analysis of the integral membrane proteome by tandem mass spectrometric techniques added another 141 identified proteins not identified by the 2‐DE approach (see following paper).Keywords
This publication has 42 references indexed in Scilit:
- Improvement of the two‐dimensional gel electrophoresis analysis for the proteome study of Halobacterium salinarumProteomics, 2003
- Proteomic Analysis of an Extreme Halophilic Archaeon, Halobacterium sp. NRC-1Molecular & Cellular Proteomics, 2003
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Probability-based protein identification by searching sequence databases using mass spectrometry dataElectrophoresis, 1999
- Improved microbial gene identification with GLIMMERNucleic Acids Research, 1999
- R: A Language for Data Analysis and GraphicsJournal of Computational and Graphical Statistics, 1996
- The ecology of the extremely halophilic archaeaFEMS Microbiology Reviews, 1994
- Rapid identification of proteins by peptide-mass fingerprintingCurrent Biology, 1993
- Anaerobic growth of halophilic archaeobacteria by reduction of dimethysulfoxide and trimethylamineN-oxideFEMS Microbiology Letters, 1990
- Light‐induced changes of the pH gradient and the membrane potential in H. halobiumFEBS Letters, 1976