Comparison of 61 Sequenced Escherichia coli Genomes
Top Cited Papers
Open Access
- 11 July 2010
- journal article
- research article
- Published by Springer Nature in Microbial Ecology
- Vol. 60 (4) , 708-720
- https://doi.org/10.1007/s00248-010-9717-3
Abstract
Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae.Keywords
This publication has 53 references indexed in Scilit:
- Clonal Relationship among Atypical Enteropathogenic Escherichia coli Strains Isolated from Different Animal Species and HumansApplied and Environmental Microbiology, 2009
- Comparative genomics reveal the mechanism of the parallel evolution of O157 and non-O157 enterohemorrhagic Escherichia coliProceedings of the National Academy of Sciences, 2009
- Genome Project Standards in a New Era of SequencingScience, 2009
- Complete Genome Sequence and Comparative Analysis of the Wild-type Commensal Escherichia coli Strain SE11 Isolated from a Healthy AdultDNA Research, 2008
- Then and now: use of 16S rDNA gene sequencing for bacterial identification and discovery of novel bacteria in clinical microbiology laboratoriesClinical Microbiology & Infection, 2008
- Clustal W and Clustal X version 2.0Bioinformatics, 2007
- RNAmmer: consistent and rapid annotation of ribosomal RNA genesNucleic Acids Research, 2007
- Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli : A comparative genomics approachProceedings of the National Academy of Sciences, 2006
- Highly accurate genome sequences ofEscherichia coliK‐12 strains MG1655 and W3110Molecular Systems Biology, 2006
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997