Genome Sequence of Avery's Virulent Serotype 2 Strain D39 ofStreptococcus pneumoniaeand Comparison with That of Unencapsulated Laboratory Strain R6
Top Cited Papers
- 1 January 2007
- journal article
- review article
- Published by American Society for Microbiology in Journal of Bacteriology
- Vol. 189 (1) , 38-51
- https://doi.org/10.1128/jb.01148-06
Abstract
Streptococcus pneumoniae (pneumococcus) is a leading human respiratory pathogen that causes a variety of serious mucosal and invasive diseases. D39 is an historically important serotype 2 strain that was used in experiments by Avery and coworkers to demonstrate that DNA is the genetic material. Although isolated nearly a century ago, D39 remains extremely virulent in murine infection models and is perhaps the strain used most frequently in current studies of pneumococcal pathogenesis. To date, the complete genome sequences have been reported for only two S. pneumoniae strains: TIGR4, a recent serotype 4 clinical isolate, and laboratory strain R6, an avirulent, unencapsulated derivative of strain D39. We report here the genome sequences and new annotation of two different isolates of strain D39 and the corrected sequence of strain R6. Comparisons of these three related sequences allowed deduction of the likely sequence of the D39 progenitor and mutations that arose in each isolate. Despite its numerous repeated sequences and IS elements, the serotype 2 genome has remained remarkably stable during cultivation, and one of the D39 isolates contains only five relatively minor mutations compared to the deduced D39 progenitor. In contrast, laboratory strain R6 contains 71 single-basepair changes, six deletions, and four insertions and has lost the cryptic pDP1 plasmid compared to the D39 progenitor strain. Many of these mutations are in or affect the expression of genes that play important roles in regulation, metabolism, and virulence. The nature of the mutations that arose spontaneously in these three strains, the relative global transcription patterns determined by microarray analyses, and the implications of the D39 genome sequences to studies of pneumococcal physiology and pathogenesis are presented and discussed.Keywords
This publication has 122 references indexed in Scilit:
- Genomic Diversity between Strains of the Same Serotype and Multilocus Sequence Type among Pneumococcal Clinical IsolatesInfection and Immunity, 2006
- Catabolite Control Protein A (CcpA) Contributes to Virulence and Regulation of Sugar Metabolism in Streptococcus pneumoniaeJournal of Bacteriology, 2005
- Choline-Binding Protein D (CbpD) inStreptococcus pneumoniaeIs Essential for Competence-Induced Cell LysisJournal of Bacteriology, 2005
- Improved Prediction of Signal Peptides: SignalP 3.0Journal of Molecular Biology, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001
- Improved microbial gene identification with GLIMMERNucleic Acids Research, 1999
- tRNAscan-SE: A Program for Improved Detection of Transfer RNA Genes in Genomic SequenceNucleic Acids Research, 1997
- The genetic basis of colony opacity in Streptococcus pneumoniae: evidence for the effect of box elements on the frequency of phenotypic variationMolecular Microbiology, 1995
- Basic local alignment search toolJournal of Molecular Biology, 1990