Estimating Diagnostic Test Accuracy Using a "Fuzzy Gold Standard"
- 1 February 1995
- journal article
- Published by SAGE Publications in Medical Decision Making
- Vol. 15 (1) , 44-57
- https://doi.org/10.1177/0272989x9501500108
Abstract
This study uses Monte Carlo methods to analyze the consequences of having a criterion standard ("gold standard") that contains some error when analyzing the accuracy of a diagnostic test using ROC curves. Two phenomena emerge: 1) When diagnostic test errors are statistically independent from inaccurate ("fuzzy") gold standard (FGS) errors, estimated test accuracy declines. 2) When the test and the FGS have statistically dependent errors, test accuracy can become overstated. Two methods are proposed to eliminate the first of these errors, exploring the risk of exacerbating the second. Both require a probabilistic (rather than binary) gold-standard statement (e.g., probability that each case is abnormal). The more promising of these, the "two-truth" method, selectively eliminates those cases where the gold standard is most ambiguous (probability near 0.5). When diagnostic test and FGS errors are independent, this approach can eliminate much of the downward bias caused by FGS error, without meaningful risk of overstating test accuracy. When the test and FGS have dependent errors, the resultant upward bias can cause test accuracy to be overstated, in the most extreme cases, even before the offsetting "two-truth" approach is employed. Key words: ROC curves; diagnostic test accuracy; technology assessment. (Med Decis Making 1995;15:44-57)Keywords
This publication has 21 references indexed in Scilit:
- The Accuracy of Magnetic Resonance Imaging in Patients With Suspected Multiple SclerosisJAMA, 1993
- Receiver Operator characteristic (ROC) Analysis without TruthMedical Decision Making, 1990
- Analyzing a Portion of the ROC CurveMedical Decision Making, 1989
- Estimation of test error rates, disease prevalence and relative risk from misclassified data: a reviewJournal of Clinical Epidemiology, 1988
- Evaluating Rapid Tests for Streptococcal PharyngitisMedical Decision Making, 1987
- The influence of uninterpretability on the assessment of diagnostic testsJournal of Chronic Diseases, 1986
- Assessment of Diagnostic TechnologiesInvestigative Radiology, 1985
- Evaluating Diagnostic TestsPublished by JSTOR ,1981
- Methodology for the assessment of new dichotomous diagnostic testsJournal of Chronic Diseases, 1981
- Problems of Spectrum and Bias in Evaluating the Efficacy of Diagnostic TestsNew England Journal of Medicine, 1978