Testing a Point Null Hypothesis: The Irreconcilability ofPValues and Evidence
- 1 March 1987
- journal article
- research article
- Published by Taylor & Francis in Journal of the American Statistical Association
- Vol. 82 (397) , 112-122
- https://doi.org/10.1080/01621459.1987.10478397
Abstract
The problem of testing a point null hypothesis (or a “small interval” null hypothesis) is considered. Of interest is the relationship between the P value (or observed significance level) and conditional and Bayesian measures of evidence against the null hypothesis. Although one might presume that a small P value indicates the presence of strong evidence against the null, such is not necessarily the case. Expanding on earlier work [especially Edwards, Lindman, and Savage (1963) and Dickey (1977)], it is shown that actual evidence against a null (as measured, say, by posterior probability or comparative likelihood) can differ by an order of magnitude from the P value. For instance, data that yield a P value of .05, when testing a normal mean, result in a posterior probability of the null of at least .30 for any objective prior distribution. (“Objective” here means that equal prior weight is given the two hypotheses and that the prior is symmetric and nonincreasing away from the null; other definitions of “objective” will be seen to yield qualitatively similar results.) The overall conclusion is that P values can be highly misleading measures of the evidence provided by the data against the null hypothesis.Keywords
This publication has 24 references indexed in Scilit:
- Robust Bayes and Empirical Bayes Analysis with $_\epsilon$-Contaminated PriorsThe Annals of Statistics, 1986
- Statistical Decision Theory and Bayesian AnalysisPublished by Springer Nature ,1985
- Clinical Trials and Statistical Verdicts: Probable Grounds for AppealAnnals of Internal Medicine, 1983
- Is the Tail Area Useful as an Approximate Bayes Factor?Journal of the American Statistical Association, 1977
- Doing What Comes Naturally: Interpreting a Tail Area as a Posterior Probability or as a Likelihood RatioJournal of the American Statistical Association, 1973
- The Weighted Likelihood Ratio, Linear Hypotheses on Normal Location ParametersThe Annals of Mathematical Statistics, 1971
- The Weighted Likelihood Ratio, Sharp Hypotheses about Chances, the Order of a Markov ChainThe Annals of Mathematical Statistics, 1970
- Bayesian statistical inference for psychological research.Psychological Review, 1963
- Tests of Significance Considered as EvidenceJournal of the American Statistical Association, 1942
- Some Difficulties of Interpretation Encountered in the Application of the Chi-Square TestJournal of the American Statistical Association, 1938