Spectrum bias: a quantitative and graphical analysis of the variability of medical diagnostic test performance
- 29 October 2003
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 23 (1) , 125-135
- https://doi.org/10.1002/sim.1591
Abstract
A Correction to this article has been published in Statistics in Medicine 2005, 24:(11) 1782. The performance of a medical diagnostic test may vary in a subgroup of patients according to the severity and clinical presentation of the disease. This phenomenon is often referred to as the spectrum effect. There is currently no technique to determine in which situations this spectrum effect may lead to a spectrum bias, that is, a distortion of the posterior probability which can potentially affect the clinical decision. We show that spectrum bias, on either a positive or a negative test result, can be expressed as the subgroup‐specific likelihood ratio (LR) divided by the LR in the overall population of patients. This assessment of spectrum bias is independent of the pretest probability. We present here the test statistic and its variance and also propose a mode of graphically visualizing the presence of spectrum bias. We applied it to 15 examples from the literature, of which three are discussed in detail. In two examples, there was spectrum bias: (1) diagnosis of urinary tract infection in subgroups of low or high risk patients when using the dipstick test, and (2) diagnosis of left ventricular hypertrophy using electrocardiography in subjects with different body mass index. In a third example, there was no spectrum bias for the diagnosis of coronary heart disease in subgroups of patients defined by age, sex or comorbidities when using exercise electrocardiography. As sensitivity and specificity usually vary in opposite directions from one subgroup to the other, the ratio of LRs tends to remain constant, resulting in no or little spectrum bias. We conclude that spectrum effect is more common than spectrum bias and that clinical decisions are affected only under certain conditions. Copyright © 2003 John Wiley & Sons, Ltd.Keywords
This publication has 21 references indexed in Scilit:
- Limitations of Sensitivity, Specificity, Likelihood Ratio, and Bayesʼ Theorem in Assessing Diagnostic ProbabilitiesEpidemiology, 1997
- Comparison of the sensitivity and specificity of exercise electrocardiography in biased and unbiased populations of men and womenAmerican Heart Journal, 1995
- Diagnostic tests for deep vein thrombosis. Clinical usefulness depends on probability of diseaseArchives of internal medicine (1960), 1994
- Uncertainty in BayesMedical Decision Making, 1994
- The spectrum of patients strongly influences the usefulness of diagnostic tests for pneumoniaScandinavian Journal of Primary Health Care, 1993
- Comparison of mammary serum antigen (MSA) with β2-microglobulin (β2M) and carcinoembryonic antigen (CEA) assays in patients with breast cancerEuropean Journal of Cancer and Clinical Oncology, 1988
- Factors affecting sensitivity and specificity of a diagnostic test: the exercise thallium scintigramThe American Journal of Medicine, 1988
- The Threshold Approach to Clinical Decision MakingNew England Journal of Medicine, 1980
- Exercise Stress TestingNew England Journal of Medicine, 1979
- Problems of Spectrum and Bias in Evaluating the Efficacy of Diagnostic TestsNew England Journal of Medicine, 1978