Probabilistic index: an intuitive non‐parametric approach to measuring the size of treatment effects
- 5 September 2005
- journal article
- research article
- Published by Wiley in Statistics in Medicine
- Vol. 25 (4) , 591-602
- https://doi.org/10.1002/sim.2256
Abstract
Effect sizes (ES) tell the magnitude of the difference between treatments and, ideally, should tell clinicians how likely their patients will benefit from the treatment. Currently used ES are expressed in statistical rather than in clinically useful terms and may not give clinicians the appropriate information. We restrict our discussion to studies with two groups: one with n patients receiving a new treatment and the other with m patients receiving the usual or no treatment. The standardized mean difference (e.g. Cohen's d) is a well‐known index for continuous outcomes. There is some intuitive value to d, but measuring improvement in standard deviations (SD) is a statistical concept that may not help a clinician. How much improvement is a half SD? A more intuitive and simple‐to‐calculate ES is the probability that the response of a patient given the new treatment (X) is better than the one for a randomly chosen patient given the old or no treatment (Y) (i.e. P(X > Y), larger values meaning better outcomes). This probability has an immediate identity with the area under the curve (AUC) measure in procedures for receiver operator characteristic (ROC) curve comparing responses to two treatments. It also can be easily calculated from the Mann–Whitney U, Wilcoxon, or Kendall τ statistics. We describe the characteristics of an ideal ES. We propose P(X > Y) as an alternative index, summarize its correspondence with well‐known non‐parametric statistics, compare it to the standardized mean difference index, and illustrate with clinical data. Copyright © 2005 John Wiley & Sons, Ltd.Keywords
This publication has 23 references indexed in Scilit:
- Editors Can Lead Researchers to Confidence Intervals, but Can't Make Them ThinkPsychological Science, 2004
- A History of Effect Size IndicesEducational and Psychological Measurement, 2002
- Effect-Size Estimates: Issues and Problems in InterpretationJournal of Consumer Research, 1996
- Probability of the superior outcome of one treatment over another.Journal of Applied Psychology, 1994
- Confidence Intervals for Probabilities and Tolerance Regions Based on a Generalization of the Mann-Whitney StatisticJournal of the American Statistical Association, 1990
- Confidence Intervals for Probabilities and Tolerance Regions Based on a Generalization of the Mann-Whitney StatisticJournal of the American Statistical Association, 1990
- Sample Size Determination for Some Common Nonparametric TestsJournal of the American Statistical Association, 1987
- The Interpretation of Rank CorrelationsJournal of the Royal Statistical Society Series C: Applied Statistics, 1975
- On Constructing Statistics and Reporting DataThe American Statistician, 1971
- The Estimation of Reliability from Stress-Strength RelationshipsTechnometrics, 1970