Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms
- 10 April 2006
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 16 (6) , 757-767
- https://doi.org/10.1101/gr.4759706
Abstract
Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens.Keywords
This publication has 74 references indexed in Scilit:
- Acquisition ofstcE,a C1 Esterase Inhibitor–Specific Metalloprotease, during the Evolution ofEscherichia coliO157:H7The Journal of Infectious Diseases, 2003
- Genetic and Evolutionary Analysis of Mutations in thegusAGene That Cause the Absence of β‐Glucuronidase Activity inEscherichia coliO157:H7The Journal of Infectious Diseases, 2001
- PulseNet: The Molecular Subtyping Network for Foodborne Bacterial Disease Surveillance, United StatesEmerging Infectious Diseases, 2001
- PulseNet: The Molecular Subtyping Network for Foodborne Bacterial Disease Surveillance, United StatesEmerging Infectious Diseases, 2001
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequencesBioinformatics, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Split decomposition: A new and useful approach to phylogenetic analysis of distance dataMolecular Phylogenetics and Evolution, 1992
- A nested PCR followed by magnetic separation of amplified fragments for detection of Escherichia coli Shiga-like toxin genesMolecular and Cellular Probes, 1991
- Toxin Genotypes and Plasmid Profiles as Determinants of Systemic Sequelae in Escherichia coli O157:H7 InfectionsThe Journal of Infectious Diseases, 1989