Inferring genomic flux in bacteria
- 17 November 2008
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 19 (2) , 306-317
- https://doi.org/10.1101/gr.082263.108
Abstract
Acquisition and loss of genetic material are essential forces in bacterial microevolution. They have been repeatedly linked with adaptation of lineages to new lifestyles, and in particular, pathogenicity. Comparative genomics has the potential to elucidate this genetic flux, but there are many methodological challenges involved in inferring evolutionary events from collections of genome sequences. Here we describe a model-based method for using whole-genome sequences to infer the patterns of genome content evolution. A fundamental property of our model is that it allows the rates at which genetic elements are gained or lost to vary in time and from one lineage to another. Our approach is purely sequence based, and does not rely on gene identification. We show how inference can be performed under our model and illustrate its use on three datasets fromFrancisella tularensis, Streptococcus pyogenes, andEscherichia coli. In all three examples, we found interesting variations in the rates of genetic material gain and loss, which strongly correlate with their lifestyle. The algorithms we describe are implemented in a computer software named GenoPlast.This publication has 68 references indexed in Scilit:
- A bimodal pattern of relatedness between the Salmonella Paratyphi A and Typhi genomes: Convergence or divergence by homologous recombination?Genome Research, 2006
- How to become a uropathogen: Comparative genomic analysis of extraintestinal pathogenic Escherichia coli strainsProceedings of the National Academy of Sciences, 2006
- Molecular genetic anatomy of inter- and intraserotype variation in the human bacterial pathogen group A StreptococcusProceedings of the National Academy of Sciences, 2006
- Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli : A comparative genomics approachProceedings of the National Academy of Sciences, 2006
- The complete genome sequence of Francisella tularensis, the causative agent of tularemiaNature Genetics, 2005
- Massive gene decay in the leprosy bacillusNature, 2001
- Reversible jump Markov chain Monte Carlo computation and Bayesian model determinationBiometrika, 1995
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989
- Hybrid penicillin-binding proteins in penicillin-resistant strains of Neisseria gonorrhoeaeNature, 1988
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981