Estimating Diagnostic Test Accuracy Using a "Fuzzy Gold Standard"

1 February 1995

journal article
Published by SAGE Publications in Medical Decision Making

Vol. 15 (1) , 44-57
https://doi.org/10.1177/0272989x9501500108

Abstract

This study uses Monte Carlo methods to analyze the consequences of having a criterion standard ("gold standard") that contains some error when analyzing the accuracy of a diagnostic test using ROC curves. Two phenomena emerge: 1) When diagnostic test errors are statistically independent from inaccurate ("fuzzy") gold standard (FGS) errors, estimated test accuracy declines. 2) When the test and the FGS have statistically dependent errors, test accuracy can become overstated. Two methods are proposed to eliminate the first of these errors, exploring the risk of exacerbating the second. Both require a probabilistic (rather than binary) gold-standard statement (e.g., probability that each case is abnormal). The more promising of these, the "two-truth" method, selectively eliminates those cases where the gold standard is most ambiguous (probability near 0.5). When diagnostic test and FGS errors are independent, this approach can eliminate much of the downward bias caused by FGS error, without meaningful risk of overstating test accuracy. When the test and FGS have dependent errors, the resultant upward bias can cause test accuracy to be overstated, in the most extreme cases, even before the offsetting "two-truth" approach is employed. Key words: ROC curves; diagnostic test accuracy; technology assessment. (Med Decis Making 1995;15:44-57)

Keywords

This publication has 21 references indexed in Scilit:

The Accuracy of Magnetic Resonance Imaging in Patients With Suspected Multiple Sclerosis
JAMA, 1993
Receiver Operator characteristic (ROC) Analysis without Truth
Medical Decision Making, 1990
Analyzing a Portion of the ROC Curve
Medical Decision Making, 1989
Estimation of test error rates, disease prevalence and relative risk from misclassified data: a review
Journal of Clinical Epidemiology, 1988
Evaluating Rapid Tests for Streptococcal Pharyngitis
Medical Decision Making, 1987
The influence of uninterpretability on the assessment of diagnostic tests
Journal of Chronic Diseases, 1986
Assessment of Diagnostic Technologies
Investigative Radiology, 1985
Evaluating Diagnostic Tests
Published by JSTOR ,1981
Methodology for the assessment of new dichotomous diagnostic tests
Journal of Chronic Diseases, 1981
Problems of Spectrum and Bias in Evaluating the Efficacy of Diagnostic Tests
New England Journal of Medicine, 1978