Shared genomic variants: identification of transmission routes using pathogen deep sequence data
Preprint
- 20 November 2015
- preprint
- Published by Cold Spring Harbor Laboratory in bioRxiv
- p. 032458
- https://doi.org/10.1101/032458
Abstract
Sequencing pathogen samples during a communicable disease outbreak is becoming an increasingly common procedure in epidemiological investigations. Identifying who infected whom sheds considerable light on transmission patterns, high-risk settings and subpopulations, and infection control effectiveness. Genomic data shed new light on transmission dynamics, and can be used to identify clusters of individuals likely to be linked by direct transmission. However, identification of individual routes of infection via single genome samples typically remains uncertain. Here, we investigate the potential of deep sequence data to provide greater resolution on transmission routes, via the identification of shared genomic variants. We assess several easily implemented methods to identify transmission routes using both shared variants and genetic distance, demonstrating that shared variants can provide considerable additional information in most scenarios. While shared variant approaches identify relatively few links in the presence of a small transmission bottleneck, these links are highly confident. Furthermore, we proposed hybrid approach additionally incorporating phylogenetic distance to provide greater resolution. We apply our methods to data collected during the 2014 Ebola outbreak, identifying several likely routes of transmission. Our study highlights the power of pathogen deep sequence data as a component of outbreak investigation and epidemiological analyses.Keywords
All Related Versions
- Published version: American Journal of Epidemiology, 186 (10), 1209.
This publication has 39 references indexed in Scilit:
- Bayesian Inference of Infectious Disease Transmission from Whole-Genome Sequence DataMolecular Biology and Evolution, 2014
- Within-Host Bacterial Diversity Hinders Accurate Reconstruction of Transmission Networks from Genomic Distance DataPLoS Computational Biology, 2014
- Bayesian Reconstruction of Disease Outbreaks by Combining Epidemiologic and Genomic DataPLoS Computational Biology, 2014
- Relating Phylogenetic Trees to Transmission Trees of Infectious Disease OutbreaksGenetics, 2013
- Unravelling transmission trees of infectious diseases by combining genetic and epidemiological dataProceedings Of The Royal Society B-Biological Sciences, 2011
- Whole-Genome Sequencing and Social-Network Analysis of a Tuberculosis OutbreakNew England Journal of Medicine, 2011
- Reconstructing disease outbreaks from genetic data: a graph approachHeredity, 2010
- spa Typing of Staphylococcus aureus as a Frontline Tool in Epidemiological TypingJournal of Clinical Microbiology, 2008
- Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virusProceedings Of The Royal Society B-Biological Sciences, 2008
- spa Typing Method for Discriminating among Staphylococcus aureus Isolates: Implications for Use of a Single Marker To Detect Genetic Micro- and MacrovariationJournal of Clinical Microbiology, 2004