Real time spatial cluster detection using interpoint distances among precise patient locations
Open Access
- 21 June 2005
- journal article
- Published by Springer Nature in BMC Medical Informatics and Decision Making
- Vol. 5 (1) , 19
- https://doi.org/10.1186/1472-6947-5-19
Abstract
Background: Public health departments in the United States are beginning to gain timely access to health data, often as soon as one day after a visit to a health care facility. Consequently, new approaches to outbreak surveillance are being developed. When cases cluster geographically, an analysis of their spatial distribution can facilitate outbreak detection. Our method focuses on detecting perturbations in the distribution of pair-wise distances among all patients in a geographical region. Barring outbreaks, this distribution can be quite stable over time. We sought to exemplify the method by measuring its cluster detection performance, and to determine factors affecting sensitivity to spatial clustering among patients presenting to hospital emergency departments with respiratory syndromes. Methods: The approach was to (1) define a baseline spatial distribution of home addresses for a population of patients visiting an emergency department with respiratory syndromes using historical data; (2) develop a controlled feature set simulation by inserting simulated outbreak data with varied parameters into authentic background noise, thereby creating semisynthetic data; (3) compare the observed with the expected spatial distribution; (4) establish the relative value of different alarm strategies so as to maximize sensitivity for the detection of clustering; and (5) measure factors which have an impact on sensitivity. Results: Overall sensitivity to detect spatial clustering was 62%. This contrasts with an overall alarm rate of less than 5% for the same number of extra visits when the extra visits were not characterized by geographic clustering. Clusters that produced the least number of alarms were those that were small in size (10 extra visits in a week, where visits per week ranged from 120 to 472), diffusely distributed over an area with a 3 km radius, and located close to the hospital (5 km) in a region most densely populated with patients to this hospital. Near perfect alarm rates were found for clusters that varied on the opposite extremes of these parameters (40 extra visits, within a 250 meter radius, 50 km from the hospital). Conclusion: Measuring perturbations in the interpoint distance distribution is a sensitive method for detecting spatial clustering. When cases are clustered geographically, there is clearly power to detect clustering when the spatial distribution is represented by the M statistic, even when clusters are small in size. By varying independent parameters of simulated outbreaks, we have demonstrated empirically the limits of detection of different types of outbreaks.Keywords
This publication has 8 references indexed in Scilit:
- Modeling emergency department visit patterns for infectious disease complaints: results and application to disease surveillanceBMC Medical Informatics and Decision Making, 2005
- The interpoint distance distribution as a descriptor of point patterns, with an application to spatial disease clusteringStatistics in Medicine, 2004
- Use of Emergency Department Chief Complaint and Diagnostic Codes for Identifying Respiratory Illness in a Pediatric PopulationPediatric Emergency Care, 2004
- Syndromic Surveillance in Public Health Practice, New York CityEmerging Infectious Diseases, 2004
- Measuring Outbreak-Detection Performance By Using Controlled Feature Set SimulationsMMWR Supplements, 2004
- Implementing Syndromic Surveillance: A Practical Guide Informed by the Early ExperienceJournal of the American Medical Informatics Association, 2003
- Time series modeling for syndromic surveillanceBMC Medical Informatics and Decision Making, 2003
- A Review and Discussion of Prospective Statistical Surveillance in Public HealthJournal of the Royal Statistical Society Series A: Statistics in Society, 2003