Data Mining: Statistics and More?
- 1 May 1998
- journal article
- research article
- Published by Taylor & Francis in The American Statistician
- Vol. 52 (2) , 112-118
- https://doi.org/10.1080/00031305.1998.10480549
Abstract
Data mining is a new discipline lying at the interface of statistics, database technology, pattern recognition, machine learning, and other areas. It is concerned with the secondary analysis of large databases in order to find previously unsuspected relationships which are of interest or value to the database owners. New problems arise, partly as a consequence of the sheer size of the data sets involved, and partly because of issues of pattern matching. However, since statistics provides the intellectual glue underlying the effort, it is important for statisticians to become involved. There are very real opportunities for statisticians to make significant contributions.Keywords
This publication has 8 references indexed in Scilit:
- Inference for Non-random SamplesJournal of the Royal Statistical Society Series B: Statistical Methodology, 1997
- Bayesian Networks for Data MiningData Mining and Knowledge Discovery, 1997
- Advanced Scout: Data Mining and Knowledge Discovery in NBA DataData Mining and Knowledge Discovery, 1997
- Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-TotalsData Mining and Knowledge Discovery, 1997
- EditorialData Mining and Knowledge Discovery, 1997
- A database perspective on knowledge discoveryCommunications of the ACM, 1996
- IDEAACM SIGMOD Record, 1996
- Role of Models in Statistical AnalysisStatistical Science, 1990