Temporal signal and the phylodynamic threshold of SARS-CoV-2
Preprint
- 4 May 2020
- preprint
- Published by Cold Spring Harbor Laboratory in bioRxiv
Abstract
The ongoing SARS-CoV-2 outbreak marks the first time that large amounts of genome sequence data have been generated and made publicly available in near real-time. Early analyses of these data revealed low sequence variation, a finding that is consistent with a recently emerging outbreak, but which raises the question of whether such data are sufficiently informative for phylogenetic inferences of evolutionary rates and time scales. The phylodynamic threshold is a key concept that refers to the point in time at which sufficient molecular evolutionary change has accumulated in available genome samples to obtain robust phylodynamic estimates. For example, before the phylodynamic threshold is reached, genomic variation is so low that even large amounts of genome sequences may be insufficient to estimate the virus’s evolutionary rate and the time scale of an outbreak. We collected genome sequences of SARS-CoV-2 from public databases at 8 different points in time and conducted a range of tests of temporal signal to determine if and when the phylodynamic threshold was reached, and the range of inferences that could be reliably drawn from these data. Our results indicate that by February 2nd 2020, estimates of evolutionary rates and time scales had become possible. Analyses of subsequent data sets, that included between 47 to 122 genomes, converged at an evolutionary rate of about 1.1×10−3 subs/site/year and a time of origin of around late November 2019. Our study provides guidelines to assess the phylodynamic threshold and demonstrates that establishing this threshold constitutes a fundamental step for understanding the power and limitations of early data in outbreak genome surveillance.Keywords
All Related Versions
- Published version: Virus Evolution, 6 (2), veaa061.
This publication has 24 references indexed in Scilit:
- Bayesian Evaluation of Temporal Signal in Measurably Evolving PopulationsbioRxiv, 2019
- Emerging Concepts of Data Integration in Pathogen PhylodynamicsSystematic Biology, 2016
- Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic UncertaintySystematic Biology, 2015
- Measurably evolving pathogens in the genomic eraPublished by Elsevier ,2015
- The Performance of the Date-Randomization Test in Phylogenetic Analyses of Time-Structured Virus DataMolecular Biology and Evolution, 2015
- Inference of Epidemiological Dynamics Based on Simulated Phylogenies Using Birth-Death and Coalescent ModelsPLoS Computational Biology, 2014
- Accurate Model Selection of Relaxed Molecular Clocks in Bayesian PhylogeneticsMolecular Biology and Evolution, 2012
- Choosing among Partition Models in Bayesian PhylogeneticsMolecular Biology and Evolution, 2010
- Measurably evolving populationsPublished by Elsevier ,2003
- Molecular clock of viral evolution, and the neutral theory.Proceedings of the National Academy of Sciences, 1990