Abstract
It is conventionally thought that a smallp‐value confers high credibility on the observed alternative hypothesis, and that a repetition of the same experiment will have a high probability of resulting again in statistical significance. It is shown that if the observed difference is the true one, the probability of repeating a statistically significant result, the ‘replication probability’, is substantially lower than expected. The reason for this is a mistake that generates other seeming paradoxes: the interpretation of the post‐trialp‐value in the same way as the pre‐trial α error. The replication probability can be used as a frequentist counterpart of Bayesian and likelihood methods to show thatp‐values overstate the evidence against the null hypothesis.

This publication has 7 references indexed in Scilit: