Identification of Escherichia coli O157:H7 Genomic Regions Conserved in Strains with a Genotype Associated with Human Infection

Abstract
Beta-glucuronidase-negative, sorbitol-nonfermenting isolates of Shiga toxin-producing Escherichia coli O157 comprise part of a clone complex of related enterohemorrhagic E. coli isolates. High-resolution genotyping shows that the O157 populations have diverged into two different lineages that appear to have different ecologies. To identify genomic regions unique to the most common human-associated genotype, suppression subtractive hybridization was used to identify DNA sequences present in two clinical strains representing the human lineage I O157:H7 strains but absent from two bovine-derived lineage II strains. PCR assays were then used to test for the presence of these regions in 10 lineage I strains and 20 lineage II strains. Twelve conserved regions of genomic difference for lineage I (CRD I ) were identified that were each present in at least seven of the lineage I strains but absent in most of the lineage II strains tested. The boundaries of the lineage I conserved regions were further delimited by PCR. Eleven of these CRD I were associated with E. coli Sakai S-loops 14, 16, 69, 72, 78, 82, 83, 91 to 93, 153, and 286, and the final CRD I was located on the pO157 virulence plasmid. Several potential virulence factors were identified within these regions, including a putative hemolysin-activating protein, an iron transport system, and several possible regulatory genes. Cluster analysis based on lineage I conserved regions showed that the presence/absence of these regions was congruent with the inferred phylogeny of the strains.