Within-Host Bacterial Diversity Hinders Accurate Reconstruction of Transmission Networks from Genomic Distance Data
Top Cited Papers
Open Access
- 27 March 2014
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 10 (3) , e1003549
- https://doi.org/10.1371/journal.pcbi.1003549
Abstract
The prospect of using whole genome sequence data to investigate bacterial disease outbreaks has been keenly anticipated in many quarters, and the large-scale collection and sequencing of isolates from cases is becoming increasingly feasible. While sequence data can provide many important insights into disease spread and pathogen adaptation, it remains unclear how successfully they may be used to estimate individual routes of transmission. Several studies have attempted to reconstruct transmission routes using genomic data; however, these have typically relied upon restrictive assumptions, such as a shared topology of the phylogenetic tree and a lack of within-host diversity. In this study, we investigated the potential for bacterial genomic data to inform transmission network reconstruction. We used simulation models to investigate the origins, persistence and onward transmission of genetic diversity, and examined the impact of such diversity on our estimation of the epidemiological relationship between carriers. We used a flexible distance-based metric to provide a weighted transmission network, and used receiver-operating characteristic (ROC) curves and network entropy to assess the accuracy and uncertainty of the inferred structure. Our results suggest that sequencing a single isolate from each case is inadequate in the presence of within-host diversity, and is likely to result in misleading interpretations of transmission dynamics – under many plausible conditions, this may be little better than selecting transmission links at random. Sampling more frequently improves accuracy, but much uncertainty remains, even if all genotypes are observed. While it is possible to discriminate between clusters of carriers, individual transmission routes cannot be resolved by sequence data alone. Our study demonstrates that bacterial genomic distance data alone provide only limited information on person-to-person transmission dynamics. With the advent of affordable large-scale genome sequencing for bacterial pathogens, there is much interest in using such data to identify who infected whom in a disease outbreak. Many methods exist to reconstruct the phylogeny of sampled bacteria, but the resulting tree does not necessarily share the same structure as the transmission tree linking infected persons. We explored the potential of sampled genomic data to inform the transmission tree, measuring the accuracy and precision of estimated networks based on simulated data. We demonstrated that failing to account for within-host diversity can lead to poor network reconstructions - even with repeated sampling of each carrier, there is still much uncertainty in the estimated structure. While it may be possible to identify clusters of potential sources, identifying individual transmission links is not possible using bacterial sequence data alone. This work highlights potential limitations of genomic data to investigate transmission dynamics, lending support to methods unifying all available data sources.Keywords
This publication has 43 references indexed in Scilit:
- Within-Host Evolution of Staphylococcus aureus during Asymptomatic CarriagePLOS ONE, 2013
- Whole-genome sequencing for analysis of an outbreak of meticillin-resistant Staphylococcus aureus: a descriptive studyPublished by Elsevier ,2012
- Evolutionary dynamics of Staphylococcus aureus during progression from carriage to diseaseProceedings of the National Academy of Sciences, 2012
- Parallel bacterial evolution within multiple patients identifies candidate pathogenicity genesNature Genetics, 2011
- Unravelling transmission trees of infectious diseases by combining genetic and epidemiological dataProceedings Of The Royal Society B-Biological Sciences, 2011
- Reconstructing disease outbreaks from genetic data: a graph approachHeredity, 2010
- Evolution of MRSA During Hospital Transmission and Intercontinental SpreadScience, 2010
- Integrating genetic and epidemiological data to determine transmission pathways of foot-and-mouth disease virusProceedings Of The Royal Society B-Biological Sciences, 2008
- Tracking the in vivo evolution of multidrug resistance in Staphylococcus aureus by whole-genome sequencingProceedings of the National Academy of Sciences, 2007
- Genetic adaptation by Pseudomonas aeruginosa to the airways of cystic fibrosis patientsProceedings of the National Academy of Sciences, 2006