Power of the scan statistic for detection of clustering
- 1 October 1993
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 12 (19-20) , 1829-1843
- https://doi.org/10.1002/sim.4780121910
Abstract
The scan statistic is the maximum number of events in an interval of fixed length w as the subinterval moves over the entire time frame. Previous research derived the null distribution of the scan statistic under the conditional model which assumed that the total number of events was fixed, and under the unconditional model which let the total number of events be a random variable. This paper derives approximations for the power of the scan test for a pulse alternative. Under this alternative, the relative risk of disease on a subinterval )τ, τ + w(, τ unknown, is θ‐fold as high as it is for other subintervals of length w. Two sets of approximations are given for each model. The first approximation gives highly accurate results, but requires use of a personal computer. The second procedure can be performed on a hand‐held calculator and appears very accurate for the cases examined.Keywords
This publication has 17 references indexed in Scilit:
- Some Statistical Problems in the Assessment of Inhomogeneities of DNA Sequence DataJournal of the American Statistical Association, 1991
- Approximations and Bounds for the Distribution of the Scan StatisticJournal of the American Statistical Association, 1989
- A Useful Upper Bound for the Tail Probabilities of the Scan Statistic When the Sample Size is LargeJournal of the American Statistical Association, 1985
- On the Distributions of Scan StatisticsJournal of the American Statistical Association, 1984
- Approximations for Distributions of Scan StatisticsJournal of the American Statistical Association, 1982
- A Generalization of the Karlin-McGregor Theorem on Coincidence Probabilities and an Application to ClusteringThe Annals of Probability, 1977
- Probabilities for the Size of Largest Clusters and Smallest IntervalsJournal of the American Statistical Association, 1974
- Probabilities for a $k$th Nearest Neighbor Problem on the LineThe Annals of Probability, 1973
- Some Probabilities, Expectations and Variances for the Size of Largest Clusters and Smallest IntervalsJournal of the American Statistical Association, 1966
- The Distribution of the Size of the Maximum Cluster of Points on a LineJournal of the American Statistical Association, 1965