The Philosophy of Exploratory Data Analysis
- 1 June 1983
- journal article
- research article
- Published by Cambridge University Press (CUP) in Philosophy of Science
- Vol. 50 (2) , 283-295
- https://doi.org/10.1086/289110
Abstract
This paper attempts to define Exploratory Data Analysis (EDA) more precisely than usual, and to produce the beginnings of a philosophy of this topical and somewhat novel branch of statistics.Adata setis, roughly speaking, a collection ofk-tuples for somek.In both descriptive statistics and in EDA, thesek-tuples, or functions of them, are represented in a manner matched to human and computer abilities with a view to finding patterns that are not “kinkera”. Akinkusis a pattern that has a negligible probability of being even partly potentially explicable. A potentially explicable pattern is one for which there probably exists a hypothesis of adequate “explicativity”, which is another technical probabilistic concept. A pattern can be judged to be probably potentially explicable even if we cannot find an explanation. The theory of probability understood here is one of partially ordered (interval-valued), subjective (personal) probabilities. Among other topics relevant to a philosophy of EDA are the “reduction” of data; Francis Bacon's philosophy of science; the automatic formulation of hypotheses; successive deepening of hypotheses; neurophysiology; and rationality of type II.Keywords
This publication has 8 references indexed in Scilit:
- An Empirical Inquiry concerning Human Understanding of Two-Variable Color MapsThe American Statistician, 1980
- Patterns in Residuals in the Two-Way LayoutTechnometrics, 1978
- Does an Observed Sequence of Numbers Follow a Simple Rule? (Another Look at Bode's Law)Journal of the American Statistical Association, 1971
- Topics in the Investigation of Linear Relations Fitted by the Method of Least SquaresJournal of the Royal Statistical Society Series B: Statistical Methodology, 1967
- Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency TablesThe Annals of Mathematical Statistics, 1963
- Residual AnalysisJournal of the American Statistical Association, 1961
- The Interaction Algorithm and Practical Fourier AnalysisJournal of the Royal Statistical Society Series B: Statistical Methodology, 1958
- Information Theory and Statistical MechanicsPhysical Review B, 1957