Assessing Google Flu Trends Performance in the United States during the 2009 Influenza Virus A (H1N1) Pandemic
Top Cited Papers
Open Access
- 19 August 2011
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 6 (8) , e23610
- https://doi.org/10.1371/journal.pone.0023610
Abstract
Google Flu Trends (GFT) uses anonymized, aggregated internet search activity to provide near-real time estimates of influenza activity. GFT estimates have shown a strong correlation with official influenza surveillance data. The 2009 influenza virus A (H1N1) pandemic [pH1N1] provided the first opportunity to evaluate GFT during a non-seasonal influenza outbreak. In September 2009, an updated United States GFT model was developed using data from the beginning of pH1N1. We evaluated the accuracy of each U.S. GFT model by comparing weekly estimates of ILI (influenza-like illness) activity with the U.S. Outpatient Influenza-like Illness Surveillance Network (ILINet). For each GFT model we calculated the correlation and RMSE (root mean square error) between model estimates and ILINet for four time periods: pre-H1N1, Summer H1N1, Winter H1N1, and H1N1 overall (Mar 2009–Dec 2009). We also compared the number of queries, query volume, and types of queries (e.g., influenza symptoms, influenza complications) in each model. Both models' estimates were highly correlated with ILINet pre-H1N1 and over the entire surveillance period, although the original model underestimated the magnitude of ILI activity during pH1N1. The updated model was more correlated with ILINet than the original model during Summer H1N1 (r = 0.95 and 0.29, respectively). The updated model included more search query terms than the original model, with more queries directly related to influenza infection, whereas the original model contained more queries related to influenza complications. Internet search behavior changed during pH1N1, particularly in the categories “influenza complications” and “term for influenza.” The complications associated with pH1N1, the fact that pH1N1 began in the summer rather than winter, and changes in health-seeking behavior each may have played a part. Both GFT models performed well prior to and during pH1N1, although the updated model performed better during pH1N1, especially during the summer months.Keywords
This publication has 7 references indexed in Scilit:
- Influenza-Like Illness in the Community during the Emergence of 2009 Pandemic Influenza A(H1N1) – Survey of 10 States, April 2009Clinical Infectious Diseases, 2011
- Surveillance for Influenza during the 2009 Influenza A (H1N1) Pandemic-United States, April 2009-March 2010Clinical Infectious Diseases, 2010
- Interpreting “Google Flu Trends” data for pandemic H1N1 influenza: The New Zealand experienceEurosurveillance, 2009
- Interim analysis of pandemic influenza (H1N1) 2009 in Australia: surveillance trends, age of infection and effectiveness of seasonal vaccinationEurosurveillance, 2009
- Emergence of a Novel Swine-Origin Influenza A (H1N1) Virus in HumansNew England Journal of Medicine, 2009
- Detecting influenza epidemics using search engine query dataNature, 2009
- Infodemiology: tracking flu-related searches on the web for syndromic surveillance.2006