Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation
Top Cited Papers
- 1 July 2002
- journal article
- research article
- Published by Wiley in Quarterly Journal of the Royal Meteorological Society
- Vol. 128 (584) , 2145-2166
- https://doi.org/10.1256/003590002320603584
Abstract
The areas beneath the relative (or receiver) operating characteristics (ROC) and relative operating levels (ROL) curves can be used as summary measures of forecast quality, but statistical significance tests for these areas are conducted infrequently in the atmospheric sciences. A development of signal‐detection theory, the ROC curve has been widely applied in the medical and psychology fields where significance tests and relationships to other common statistical methods have been established and described. This valuable literature appears to be largely unknown to the atmospheric sciences where applications of ROC and related techniques are becoming more common. This paper presents a survey of that literature with a focus on the interpretation of the ROC area in the field of forecast verification. We extend these foundations to demonstrate that similar principles can be applied to the interpretation and significance testing of the ROL area. It is shown that the ROC area is equivalent to the Mann–Whitney U‐statistic testing the significance of forecast event probabilities for cases where events actually occurred with those where events did not occur. A similar derivation shows that the ROL area is equivalent to the Mann–Whitney U‐statistic testing the magnitude of events with respect to whether or not an event has been forecast. Because the Mann–Whitney U‐statistic follows a known probability distribution, under certain assumptions it can be used to define the statistical significance of ROC and ROL areas and for comparing the areas of competing forecasts. For large samples the significance of either measure can be accurately assessed using a normal‐distribution approximation. Copyright © 2002 Royal Meteorological SocietyKeywords
This publication has 67 references indexed in Scilit:
- A Dynamical Approach to Seasonal Prediction of Atlantic Tropical Cyclone ActivityWeather and Forecasting, 2001
- Quantitative Precipitation Forecasts over the United States by the ECMWF Ensemble Prediction SystemMonthly Weather Review, 2001
- Impact of model resolution and ensemble size on the performance of an Ensemble Prediction SystemQuarterly Journal of the Royal Meteorological Society, 1998
- Nonparametric and semiparametric estimation of the receiver operating characteristic curveThe Annals of Statistics, 1996
- The Application of Signal Detection Theory to Weather Forecasting BehaviorMonthly Weather Review, 1992
- ROC curves and measures of discrimination accuracy: A reply to Swets.Psychological Bulletin, 1986
- Basic principles of ROC analysisSeminars in Nuclear Medicine, 1978
- The area above the ordinal dominance graph and the area below the receiver operating characteristic graphJournal of Mathematical Psychology, 1975
- The Relative Operating Characteristic in PsychologyScience, 1973
- An Approximation to the Wilcoxon-Mann-Whitney DistributionJournal of the American Statistical Association, 1969