On the identifiability of transmission dynamic models for infectious diseases
Preprint
- 5 July 2015
- preprint
- Published by Cold Spring Harbor Laboratory in bioRxiv
- p. 021972
- https://doi.org/10.1101/021972
Abstract
Understanding the transmission dynamics of infectious diseases is important for both biological research and public health applications. It has been widely demonstrated that statistical modeling provides a firm basis for inferring relevant epidemiological quantities from incidence and molecular data. However, the complexity of transmission dynamic models causes two challenges: Firstly, the likelihood function of the models is generally not computable and computationally intensive simulation-based inference methods need to be employed. Secondly, the model may not be fully identifiable from the available data. While the first difficulty can be tackled by computational and algorithmic advances, the second obstacle is more fundamental. Identifiability issues may lead to inferences which are more driven by the prior assumptions than the data themselves. We here consider a popular and relatively simple, yet analytically intractable model for the spread of tuberculosis based on classical IS6110 fingerprinting data. We report on the identifiability of the model, presenting also some methodological advances regarding the inference. Using likelihood approximations, it is shown that the reproductive value cannot be identified from the data available and that the posterior distributions obtained in previous work have likely been substantially dominated by the assumed prior distribution. Further, we show that the inferences are influenced by the assumed infectious population size which has generally been kept fixed in previous work. We demonstrate that the infectious population size can be inferred if the remaining epidemiological parameters are already known with sufficient precision.Keywords
All Related Versions
- Published version: Genetics, 202 (3), 911.
This publication has 20 references indexed in Scilit:
- Bayesian computation: a summary of the current state, and samples backwards and forwardsStatistics and Computing, 2015
- A simulated annealing approach to approximate Bayes computationsStatistics and Computing, 2014
- Likelihood-free parallel temperingStatistics and Computing, 2012
- An adaptive sequential Monte Carlo method for approximate Bayesian computationStatistics and Computing, 2011
- Modeling the joint epidemics of TB and HIV in a South African townshipJournal of Mathematical Biology, 2008
- Sequential Monte Carlo without likelihoodsProceedings of the National Academy of Sciences, 2007
- Logistic Regression, AdaBoost and Bregman DistancesMachine Learning, 2002
- Inferring Coalescence Times From DNA Sequence DataGenetics, 1997
- The Epidemiology of Tuberculosis in San Francisco -- A Population-Based Study Using Conventional and Molecular MethodsNew England Journal of Medicine, 1994
- The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programmingUSSR Computational Mathematics and Mathematical Physics, 1967