Identifying the major proteome components of Haemophilus influenzae type‐strain NCTC 8143
- 1 January 1997
- journal article
- proteome analysis
- Published by Wiley in Electrophoresis
- Vol. 18 (8) , 1314-1334
- https://doi.org/10.1002/elps.1150180808
Abstract
With the completion of the Haemophilus influenzae Rd genomic sequence, we know the identity of most of the theoretical proteins in the proteome of this bacterium. However, the most abundant components of the actual proteome are unknown. Using mass spectrometry and two‐dimensional gel electrophoresis (2‐DE), we sequenced and analyzed the most abundant proteins observed in the ATCC reference strain of H. influenzae, NCTC 8143 (303 of ≈︁ 400 Coomassie‐stained 2‐DE spots). To automate the identification of 2‐DE spots, we coupled a liquid autosampler to a microcolumn liquid chromatography electrospray ionization tandem mass spectrometer capable of identifying 22 spots per day. From the 303 sequenced spots, we identified 263 unique proteins. Most of the abundant proteins lie in an isoelectric point range of pH 4–7 and a molecular mass range of 10–100 kDa. Of the observed proteins, the most abundant is the outer membrane protein P2. Based on variety and abundance, proteins involved in energy metabolism and macromolecular synthesis are the dominant classes of proteins. Unexpectedly, tryptophanase was identified as a highly abundant protein in the strain NCTC 8143 whose sequence is rot present in the genome of the Rd strain. By searching the tandem mass spectra against the translated genomic sequence, we identified several proteins which were not annotated in the genomic sequence. Surprisingly, 22% of the identified 2‐DE spots represent isoforms in which gene products with the same primary sequence have different observed pI and Mr, indicating that these proteins are post‐translationally processed. Although most proteins' predicted and observed isoelectric points and molecular masses show reasonable concordance, the observed values for several proteins deviate significantly from the predicted values. These anomalies may represent either highly processed proteins or misinterpretations of the genomic sequence. Using the technology developed in this project, the protein expression of other strains of H. influenzae grown under different environmental conditions can be compared to identify differences in their proteomes.Keywords
This publication has 30 references indexed in Scilit:
- Rapid Protein Identification Using N-Terminal “Sequence Tag” and Amino Acid AnalysisBiochemical and Biophysical Research Communications, 1996
- Metabolism and evolution of Haemophilus influenzae deduced from a whole-genome comparison with Escherichia coliCurrent Biology, 1996
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein databaseJournal of the American Society for Mass Spectrometry, 1994
- Prevention of Haemophilus influenzae type b diseaseVaccine, 1993
- Micropreparative two‐dimensional electrophoresis allowing the separation of samples containing milligram amounts of proteinsElectrophoresis, 1993
- Approved Lists of Bacterial NamesInternational Journal of Systematic and Evolutionary Microbiology, 1980
- High resolution two-dimensional electrophoresis of basic as well as acidic proteinsCell, 1977
- A Taxonomic Study of the Genus Haemophilus, with the Proposal of a New SpeciesJournal of General Microbiology, 1976
- Cleavage of Structural Proteins during the Assembly of the Head of Bacteriophage T4Nature, 1970