Identifying the major proteome components of Haemophilus influenzae type‐strain NCTC 8143

1 January 1997

journal article
proteome analysis
Published by Wiley in Electrophoresis

Vol. 18 (8) , 1314-1334
https://doi.org/10.1002/elps.1150180808

Abstract

With the completion of the Haemophilus influenzae Rd genomic sequence, we know the identity of most of the theoretical proteins in the proteome of this bacterium. However, the most abundant components of the actual proteome are unknown. Using mass spectrometry and two‐dimensional gel electrophoresis (2‐DE), we sequenced and analyzed the most abundant proteins observed in the ATCC reference strain of H. influenzae, NCTC 8143 (303 of ≈︁ 400 Coomassie‐stained 2‐DE spots). To automate the identification of 2‐DE spots, we coupled a liquid autosampler to a microcolumn liquid chromatography electrospray ionization tandem mass spectrometer capable of identifying 22 spots per day. From the 303 sequenced spots, we identified 263 unique proteins. Most of the abundant proteins lie in an isoelectric point range of pH 4–7 and a molecular mass range of 10–100 kDa. Of the observed proteins, the most abundant is the outer membrane protein P2. Based on variety and abundance, proteins involved in energy metabolism and macromolecular synthesis are the dominant classes of proteins. Unexpectedly, tryptophanase was identified as a highly abundant protein in the strain NCTC 8143 whose sequence is rot present in the genome of the Rd strain. By searching the tandem mass spectra against the translated genomic sequence, we identified several proteins which were not annotated in the genomic sequence. Surprisingly, 22% of the identified 2‐DE spots represent isoforms in which gene products with the same primary sequence have different observed pI and M_r, indicating that these proteins are post‐translationally processed. Although most proteins' predicted and observed isoelectric points and molecular masses show reasonable concordance, the observed values for several proteins deviate significantly from the predicted values. These anomalies may represent either highly processed proteins or misinterpretations of the genomic sequence. Using the technology developed in this project, the protein expression of other strains of H. influenzae grown under different environmental conditions can be compared to identify differences in their proteomes.

Keywords

This publication has 30 references indexed in Scilit:

Rapid Protein Identification Using N-Terminal “Sequence Tag” and Amino Acid Analysis
Biochemical and Biophysical Research Communications, 1996
Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coli
Current Biology, 1996
Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae Rd
Science, 1995
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database
Journal of the American Society for Mass Spectrometry, 1994
Prevention of Haemophilus influenzae type b disease
Vaccine, 1993
Micropreparative two‐dimensional electrophoresis allowing the separation of samples containing milligram amounts of proteins
Electrophoresis, 1993
Approved Lists of Bacterial Names
International Journal of Systematic and Evolutionary Microbiology, 1980
High resolution two-dimensional electrophoresis of basic as well as acidic proteins
Cell, 1977
A Taxonomic Study of the Genus Haemophilus, with the Proposal of a New Species
Journal of General Microbiology, 1976
Cleavage of Structural Proteins during the Assembly of the Head of Bacteriophage T4
Nature, 1970