Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen)
Top Cited Papers
Open Access
- 1 January 2016
- journal article
- research article
- Published by Oxford University Press (OUP) in Virus Evolution
- Vol. 2 (1) , vew007
- https://doi.org/10.1093/ve/vew007
Abstract
Gene sequences sampled at different points in time can be used to infer molecular phylogenies on a natural timescale of months or years, provided that the sequences in question undergo measurable amounts of evolutionary change between sampling times. Data sets with this property are termed heterochronous and have become increasingly common in several fields of biology, most notably the molecular epidemiology of rapidly evolving viruses. Here we introduce the cross-platform software tool, TempEst (formerly known as Path-O-Gen), for the visualization and analysis of temporally sampled sequence data. Given a molecular phylogeny and the dates of sampling for each sequence, TempEst uses an interactive regression approach to explore the association between genetic divergence through time and sampling dates. TempEst can be used to (1) assess whether there is sufficient temporal signal in the data to proceed with phylogenetic molecular clock analysis, and (2) identify sequences whose genetic divergence and sampling date are incongruent. Examination of the latter can help identify data quality problems, including errors in data annotation, sample contamination, sequence recombination, or alignment error. We recommend that all users of the molecular clock models implemented in BEAST first check their data using TempEst prior to analysis.Keywords
This publication has 28 references indexed in Scilit:
- Systematic phylogenetic analysis of influenza A virus reveals many novel mosaic genome segmentsInfection, Genetics and Evolution, 2013
- Bayesian Phylogenetics with BEAUti and the BEAST 1.7Molecular Biology and Evolution, 2012
- Bayesian random local clocks, or one rate to rule them allBMC Biology, 2010
- Using Time-Structured Data to Estimate Evolutionary Rates of Double-Stranded DNA VirusesMolecular Biology and Evolution, 2010
- The evolutionary rate dynamically tracks changes in HIV-1 epidemics: Application of a simple method for optimizing the evolutionary rate in phylogenetic trees with longitudinal dataEpidemics, 2009
- Recent human-to-poultry host jump, adaptation, and pandemic spread of Staphylococcus aureusProceedings of the National Academy of Sciences, 2009
- The Global Circulation of Seasonal Influenza A (H3N2) VirusesScience, 2008
- Phylogenetic Evidence against Evolutionary Stasis and Natural Abiotic Reservoirs of Influenza A VirusJournal of Virology, 2008
- PAML 4: Phylogenetic Analysis by Maximum LikelihoodMolecular Biology and Evolution, 2007
- Relaxed Phylogenetics and Dating with ConfidencePLoS Biology, 2006