Behavioristic, Evidentialist, and Learning Models of Statistical Testing

1 December 1985

journal article
Published by Cambridge University Press (CUP) in Philosophy of Science

Vol. 52 (4) , 493-516
https://doi.org/10.1086/289272

Abstract

While orthodox (Neyman-Pearson) statistical tests enjoy widespread use in science, the philosophical controversy over their appropriateness for obtaining scientific knowledge remains unresolved. I shall suggest an explanation and a resolution of this controversy. The source of the controversy, I argue, is that orthodox tests are typically interpreted as rules for makingoptimal decisionsas to how tobehave–-where optimality is measured by the frequency of errors the test would commit in a long series of trials. Most philosophers of statistics, however, view the task of statistical methods as providing appropriate measures of theevidential-strengththat data affords hypotheses. Since tests appropriate for the behavioral-decision task fail to provide measures of evidential-strength, philosophers of statistics claim the use of orthodox tests in science is misleading and unjustified. What critics of orthodox tests overlook, I argue, is that the primary function of statistical tests in science is neither to decide how to behave nor to assign measures of evidential strength to hypotheses. Rather, tests provide a tool for using incomplete data tolearnabout the process that generated it. This they do, I show, by providing astandardfor distinguishing differences (between observed and hypothesized results) due to accidental or trivial errors from those due to systematic or substantively important discrepancies. I propose a reinterpretation of a commonly used orthodox test to make thislearning modelof tests explicit.

Keywords

This publication has 20 references indexed in Scilit:

Logic of Statistical Inference
Published by Cambridge University Press (CUP) ,2016
An objective theory of statistical testing
Synthese, 1983
Scientific Knowledge
Published by Springer Nature ,1981
The analogy between decision and inference
Synthese, 1977
The Neyman-Pearson theory as decision theory, and as inference theory; with a criticism of the Lindley-savage argument for Bayesian theory
Synthese, 1977
Inference, Method and Decision
Published by Springer Nature ,1977
Empirical Probability, Objective Statistical Methods, and Scientific Inquiry
Published by Springer Nature ,1976
The Logical Foundations of Statistical Interference
Published by Springer Nature ,1974
Bayesian statistics and biased procedures
Synthese, 1969
Introduction to Probability and Statistics from a Bayesian Viewpoint
Published by Cambridge University Press (CUP) ,1965