Automated time series forecasting for biosurveillance
- 5 March 2007
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 26 (22) , 4202-4218
- https://doi.org/10.1002/sim.2835
Abstract
For robust detection performance, traditional control chart monitoring for biosurveillance is based on input data free of trends, day-of-week effects, and other systematic behaviour. Time series forecasting methods may be used to remove this behaviour by subtracting forecasts from observations to form residuals for algorithmic input. We describe three forecast methods and compare their predictive accuracy on each of 16 authentic syndromic data streams. The methods are (1) a non-adaptive regression model using a long historical baseline, (2) an adaptive regression model with a shorter, sliding baseline, and (3) the Holt–Winters method for generalized exponential smoothing. Criteria for comparing the forecasts were the root-mean-square error, the median absolute per cent error (MedAPE), and the median absolute deviation. The median-based criteria showed best overall performance for the Holt–Winters method. The MedAPE measures over the 16 test series averaged 16.5, 11.6, and 9.7 for the non-adaptive regression, adaptive regression, and Holt–Winters methods, respectively. The non-adaptive regression forecasts were degraded by changes in the data behaviour in the fixed baseline period used to compute model coefficients. The mean-based criterion was less conclusive because of the effects of poor forecasts on a small number of calendar holidays. The Holt–Winters method was also most effective at removing serial autocorrelation, with most 1-day-lag autocorrelation coefficients below 0.15. The forecast methods were compared without tuning them to the behaviour of individual series. We achieved improved predictions with such tuning of the Holt–Winters method, but practical use of such improvements for routine surveillance will require reliable data classification methods. Copyright © 2007 John Wiley & Sons, Ltd.Keywords
This publication has 17 references indexed in Scilit:
- Modeling emergency department visit patterns for infectious disease complaints: results and application to disease surveillanceBMC Medical Informatics and Decision Making, 2005
- Syndromic Surveillance: Is it Worth the Effort?CHANCE, 2004
- Detection of Pediatric Respiratory and Diarrheal Outbreaks from Sales of Over-the-counter Electrolyte ProductsJournal of the American Medical Informatics Association, 2003
- A monitoring system for detecting aberrations in public health surveillance reportsStatistics in Medicine, 1999
- Using Laboratory-Based Surveillance Data for Prevention: An Algorithm for Detecting Salmonella OutbreaksEmerging Infectious Diseases, 1997
- Multivariate exponential smoothing: Method and practiceInternational Journal of Forecasting, 1989
- Holt-Winters Forecasting: Some Practical IssuesJournal of the Royal Statistical Society: Series D (The Statistician), 1988
- The Holt-Winters Forecasting ProcedureJournal of the Royal Statistical Society Series C: Applied Statistics, 1978
- The Fundamental Theorem of Exponential SmoothingOperations Research, 1961
- Forecasting Sales by Exponentially Weighted Moving AveragesManagement Science, 1960