Data Mining: Statistics and More?

1 May 1998

journal article
research article
Published by Taylor & Francis in The American Statistician

Vol. 52 (2) , 112-118
https://doi.org/10.1080/00031305.1998.10480549

Abstract

Data mining is a new discipline lying at the interface of statistics, database technology, pattern recognition, machine learning, and other areas. It is concerned with the secondary analysis of large databases in order to find previously unsuspected relationships which are of interest or value to the database owners. New problems arise, partly as a consequence of the sheer size of the data sets involved, and partly because of issues of pattern matching. However, since statistics provides the intellectual glue underlying the effort, it is important for statisticians to become involved. There are very real opportunities for statisticians to make significant contributions.

Keywords

This publication has 8 references indexed in Scilit:

Inference for Non-random Samples
Journal of the Royal Statistical Society Series B: Statistical Methodology, 1997
Bayesian Networks for Data Mining
Data Mining and Knowledge Discovery, 1997
Advanced Scout: Data Mining and Knowledge Discovery in NBA Data
Data Mining and Knowledge Discovery, 1997
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals
Data Mining and Knowledge Discovery, 1997
Editorial
Data Mining and Knowledge Discovery, 1997
A database perspective on knowledge discovery
Communications of the ACM, 1996
IDEA
ACM SIGMOD Record, 1996
Role of Models in Statistical Analysis
Statistical Science, 1990