Privacy Protection Versus Cluster Detection in Spatial Epidemiology
- 1 November 2006
- journal article
- Published by American Public Health Association in American Journal of Public Health
- Vol. 96 (11) , 2002-2008
- https://doi.org/10.2105/ajph.2005.069526
Abstract
Objectives. Patient data that includes precise locations can reveal patients’ identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases.Methods. We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy.Results. More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts.Conclusions. The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.Keywords
This publication has 28 references indexed in Scilit:
- Using software agents to preserve individual health data confidentiality in micro-scale geographical analysesJournal of Biomedical Informatics, 2006
- A Context-sensitive Approach to Anonymizing Spatial Surveillance Data: Impact on Outbreak DetectionJournal of the American Medical Informatics Association, 2006
- A Space–Time Permutation Scan Statistic for Disease Outbreak DetectionPLoS Medicine, 2005
- How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systemsJournal of Biomedical Informatics, 2004
- Implementing Syndromic Surveillance: A Practical Guide Informed by the Early ExperienceJournal of the American Medical Informatics Association, 2003
- Technical Description of RODS: A Real-time Public Health Surveillance SystemJournal of the American Medical Informatics Association, 2003
- Public Health, GIS, and Spatial Analytic ToolsAnnual Review of Public Health, 2003
- Geographic differences in invasive andin situ breast cancer incidence according to precise geographic coordinates, Connecticut, 1991-95International Journal of Cancer, 2002
- Roundtable on Bioterrorism DetectionJournal of the American Medical Informatics Association, 2002
- A spatial scan statisticCommunications in Statistics - Theory and Methods, 1997