Geostatistical analysis of disease data: estimation of cancer mortality risk from empirical frequencies using Poisson kriging
Open Access
- 14 December 2005
- journal article
- research article
- Published by Springer Nature in International Journal of Health Geographics
- Vol. 4 (1) , 31
- https://doi.org/10.1186/1476-072x-4-31
Abstract
Background: Cancer mortality maps are used by public health officials to identify areas of excess and to guide surveillance and control activities. Quality of decision-making thus relies on an accurate quantification of risks from observed rates which can be very unreliable when computed from sparsely populated geographical units or recorded for minority populations. This paper presents a geostatistical methodology that accounts for spatially varying population sizes and spatial patterns in the processing of cancer mortality data. Simulation studies are conducted to compare the performances of Poisson kriging to a few simple smoothers (i.e. population-weighted estimators and empirical Bayes smoothers) under different scenarios for the disease frequency, the population size, and the spatial pattern of risk. A public-domain executable with example datasets is provided.Results: The analysis of age-adjusted mortality rates for breast and cervix cancers illustrated some key features of commonly used smoothing techniques. Because of the small weight assigned to the rate observed over the entity being smoothed (kernel weight), the population-weighted average leads to risk maps that show little variability. Other techniques assign larger and similar kernel weights but they use a different piece of auxiliary information in the prediction: global or local means for global or local empirical Bayes smoothers, and spatial combination of surrounding rates for the geostatistical estimator. Simulation studies indicated that Poisson kriging outperforms other approaches for most scenarios, with a clear benefit when the risk values are spatially correlated. Global empirical Bayes smoothers provide more accurate predictions under the least frequent scenario of spatially random risk.Conclusion: The approach presented in this paper enables researchers to incorporate the pattern of spatial dependence of mortality rates into the mapping of risk values and the quantification of the associated uncertainty, while being easier to implement than a full Bayesian model. The availability of a public-domain executable makes the geostatistical analysis of health data, and its comparison to traditional smoothers, more accessible to common users. In future papers this methodology will be generalized to the simulation of the spatial distribution of risk values and the propagation of the uncertainty attached to predicted risks in local cluster analysis.Keywords
This publication has 39 references indexed in Scilit:
- Exploring Scale‐Dependent Correlations Between Cancer Mortality Rates Using Factorial Kriging and Population‐Weighted SemivariogramsGeographical Analysis, 2005
- A Geostatistical Framework for Area‐to‐Point Spatial InterpolationGeographical Analysis, 2004
- Interpreting Posterior Relative Risk Estimates in Disease-Mapping StudiesEnvironmental Health Perspectives, 2004
- Evaluation of Methods for Classifying Epidemiological Data on Choropleth Maps in SeriesAnnals of the American Association of Geographers, 2002
- Combining Incompatible Spatial DataJournal of the American Statistical Association, 2002
- Disease map reconstructionStatistics in Medicine, 2001
- Geostatistics: Modeling Spatial UncertaintyJournal of the American Statistical Association, 2000
- Binomial cokriging for estimating and mapping the risk of childhood cancerMathematical Medicine and Biology: A Journal of the IMA, 1998