Clustering in sparse data and an analysis of rhabdomyosarcoma incidence

Abstract
Time series of epidemiologic events often contain periods of atypically low or high frequency. Correspondingly, for quite rare diseases there occur instances of long vacuous durations interrupted noticeably by periods of some disease activity. A recent community-based observation of the incidence of rhabdomyosarcoma (RMS), and an investigation of it, yielded sparse data of this general description. We introduce a combinatorial test for patchy time series and apply it to the RMS data. We comment on the prevalent practice of post hoc data analysis of alleged clusters, and on scale effects.

This publication has 10 references indexed in Scilit: