Sample size calculations for comparative studies of medical tests for detecting presence of disease
- 19 February 2002
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 21 (6) , 835-852
- https://doi.org/10.1002/sim.1058
Abstract
Technologic advances give rise to new tests for detecting disease in many fields, including cancer and sexually transmitted disease. Before a new disease screening test is approved for public use, its accuracy should be shown to be better than or at least not inferior to an existing test. Standards do not yet exist for designing and analysing studies to address this issue. Established principles for the design of therapeutic studies can be adapted for studies of screening tests. In particular, drawing upon methods for superiority and non‐inferiority studies of therapeutic agents, we propose that confidence intervals for the relative accuracy of dichotomous tests drive the design of comparative studies of disease screening tests. We derive sample size formulae for a variety of designs, including studies where patients undergo several tests and studies where patients receive only one of the tests under evaluation. Both cohort and case‐control study designs are considered. Modifications to the confidence intervals and sample size formulae are discussed to accommodate studies where, because of the invasive nature of definitive testing, true disease status can only be obtained for subjects who are positive on one or more of the screening tests. The methods proposed are applied to a study comparing a modified pap test to the conventional pap for cervical cancer screening. The impact of error in the gold standard reference test on the design and evaluation of comparative screening test studies is also discussed. Copyright © 2002 John Wiley & Sons, Ltd.Keywords
This publication has 34 references indexed in Scilit:
- Comparing disease screening tests when true disease status is ascertained only for screen positivesBiostatistics, 2001
- Visual inspection of the uterine cervix after the application of acetic acid in the detection of cervical carcinoma and its precursorsCancer, 1998
- Comparison of the Accuracy of Two Tests with a Confirmatory Procedure Limited to Positive ResultsEpidemiology, 1997
- On the sample size for one‐sided equivalence of sensitivities based upon McNemar's testStatistics in Medicine, 1995
- Power and sample size evaluation for the mcnemar test with application to matched case‐control studiesStatistics in Medicine, 1992
- Advances in statistical methodology for diagnostic medecine ni the 1980'sStatistics in Medicine, 1991
- Sample size and power for pair‐matched case‐control studiesStatistics in Medicine, 1987
- Factors determining steric course of enzymic glycosylation reactions: glycosyl transfer products formed from 2,6-anhydro-1-deoxy-D-gluco-hept-1-enitol by .alpha.-glucosidases and an inverting exo-.alpha.-glucanaseBiochemistry, 1982
- Problems of Spectrum and Bias in Evaluating the Efficacy of Diagnostic TestsNew England Journal of Medicine, 1978