Bayesian modeling of recombination events in bacterial populations
Open Access
- 7 October 2008
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (1) , 421
- https://doi.org/10.1186/1471-2105-9-421
Abstract
We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL .Keywords
This publication has 39 references indexed in Scilit:
- Recodon: Coalescent simulation of coding DNA sequences with recombination, migration and demographyBMC Bioinformatics, 2007
- A Systematics for Discovering the Fundamental Units of Bacterial DiversityCurrent Biology, 2007
- Phylogenetic Mapping of Recombination Hotspots in Human Immunodeficiency Virus via Spatially Smoothed Change-Point ProcessesGenetics, 2007
- EnvironmentalBurkholderia cepaciaComplex Isolates from Human InfectionsEmerging Infectious Diseases, 2007
- Inference of Bacterial Microevolution Using Multilocus Sequence DataGenetics, 2007
- The Mre11 Complex Influences DNA Repair, Synapsis, and Crossing Over in Murine MeiosisCurrent Biology, 2007
- Recombination and the Nature of Bacterial SpeciationScience, 2007
- The multifarious, multireplicon Burkholderia cepacia complexNature Reviews Microbiology, 2005
- Theoretical models for heterogeneity of base composition in DNAJournal of Theoretical Biology, 1974
- Segmental distribution of nucleotides in the DNA of bacteriophage lambdaJournal of Molecular Biology, 1968