Geostatistical analysis of disease data: estimation of cancer mortality risk from empirical frequencies using Poisson kriging

Open Access

14 December 2005

journal article
research article
Published by Springer Nature in International Journal of Health Geographics

Vol. 4 (1) , 31
https://doi.org/10.1186/1476-072x-4-31

Abstract

Background: Cancer mortality maps are used by public health officials to identify areas of excess and to guide surveillance and control activities. Quality of decision-making thus relies on an accurate quantification of risks from observed rates which can be very unreliable when computed from sparsely populated geographical units or recorded for minority populations. This paper presents a geostatistical methodology that accounts for spatially varying population sizes and spatial patterns in the processing of cancer mortality data. Simulation studies are conducted to compare the performances of Poisson kriging to a few simple smoothers (i.e. population-weighted estimators and empirical Bayes smoothers) under different scenarios for the disease frequency, the population size, and the spatial pattern of risk. A public-domain executable with example datasets is provided.Results: The analysis of age-adjusted mortality rates for breast and cervix cancers illustrated some key features of commonly used smoothing techniques. Because of the small weight assigned to the rate observed over the entity being smoothed (kernel weight), the population-weighted average leads to risk maps that show little variability. Other techniques assign larger and similar kernel weights but they use a different piece of auxiliary information in the prediction: global or local means for global or local empirical Bayes smoothers, and spatial combination of surrounding rates for the geostatistical estimator. Simulation studies indicated that Poisson kriging outperforms other approaches for most scenarios, with a clear benefit when the risk values are spatially correlated. Global empirical Bayes smoothers provide more accurate predictions under the least frequent scenario of spatially random risk.Conclusion: The approach presented in this paper enables researchers to incorporate the pattern of spatial dependence of mortality rates into the mapping of risk values and the quantification of the associated uncertainty, while being easier to implement than a full Bayesian model. The availability of a public-domain executable makes the geostatistical analysis of health data, and its comparison to traditional smoothers, more accessible to common users. In future papers this methodology will be generalized to the simulation of the spatial distribution of risk values and the propagation of the uncertainty attached to predicted risks in local cluster analysis.

Keywords

This publication has 39 references indexed in Scilit:

Exploring Scale‐Dependent Correlations Between Cancer Mortality Rates Using Factorial Kriging and Population‐Weighted Semivariograms
Geographical Analysis, 2005
A Geostatistical Framework for Area‐to‐Point Spatial Interpolation
Geographical Analysis, 2004
Interpreting Posterior Relative Risk Estimates in Disease-Mapping Studies
Environmental Health Perspectives, 2004
Evaluation of Methods for Classifying Epidemiological Data on Choropleth Maps in Series
Annals of the American Association of Geographers, 2002
Combining Incompatible Spatial Data
Journal of the American Statistical Association, 2002
Disease map reconstruction
Statistics in Medicine, 2001
Geostatistics: Modeling Spatial Uncertainty
Journal of the American Statistical Association, 2000
Binomial cokriging for estimating and mapping the risk of childhood cancer
Mathematical Medicine and Biology: A Journal of the IMA, 1998