Text and Structural Data Mining of Influenza Mentions in Web and Social Media
Top Cited Papers
Open Access
- 21 February 2010
- journal article
- research article
- Published by MDPI AG in International Journal of Environmental Research and Public Health
- Vol. 7 (2) , 596-615
- https://doi.org/10.3390/ijerph7020596
Abstract
Text and structural data mining of web and social media (WSM) provides a novel disease surveillance resource and can identify online communities for targeted public health communications (PHC) to assure wide dissemination of pertinent information. WSM that mention influenza are harvested over a 24-week period, 5 October 2008 to 21 March 2009. Link analysis reveals communities for targeted PHC. Text mining is shown to identify trends in flu posts that correlate to real-world influenza-like illness patient report data. We also bring to bear a graph-based data mining technique to detect anomalies among flu blogs connected by publisher type, links, and user-tags.Keywords
This publication has 14 references indexed in Scilit:
- Telephone Triage Service Data for Detection of Influenza-Like IllnessPLOS ONE, 2009
- Detecting influenza epidemics using search engine query dataNature, 2009
- Web Queries as a Source for Syndromic SurveillancePLOS ONE, 2009
- Using Internet Searches for Influenza SurveillanceClinical Infectious Diseases, 2008
- The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data Ronen Feldman and James Sanger (Bar-Ilan University and ABS Ventures) Cambridge, England: Cambridge University Press, 2007, xii+410 pp; hardbound, ISBN 0-521-83657-3, $70.00Computational Linguistics, 2008
- Analysis of Web access logs for surveillance of influenza.2004
- Self-organization and identification of Web communitiesComputer, 2002
- Community structure in social and biological networksProceedings of the National Academy of Sciences, 2002
- The anatomy of a large-scale hypertextual Web search engineComputer Networks and ISDN Systems, 1998
- An algorithm for suffix strippingProgram: electronic library and information systems, 1980