Web mining: information and pattern discovery on the World Wide Web
Top Cited Papers
- 22 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 558-567
- https://doi.org/10.1109/tai.1997.632303
Abstract
Application of data mining techniques to the World Wide Web, referred to as Web mining, has been the focus of several recent research projects and papers. However, there is no established vocabulary, leading to confusion when comparing research efforts. The term Web mining has been used in two distinct ways. The first, called Web content mining in this paper, is the process of information discovery from sources across the World Wide Web. The second, called Web usage mining, is the process of mining for user browsing and access patterns. We define Web mining and present an overview of the various research issues, techniques, and development efforts. We briefly describe WEBMINER, a system for Web usage mining, and conclude the paper by listing research issues.Keywords
This publication has 23 references indexed in Scilit:
- A declarative language for querying and restructuring the WebPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Information gathering in the World-Wide WebACM Transactions on Database Systems, 1998
- ParaSite: mining structural information on the WebComputer Networks and ISDN Systems, 1997
- Finding salient features for personal Web page categoriesComputer Networks and ISDN Systems, 1997
- Implementing data cubes efficientlyACM SIGMOD Record, 1996
- Automatically organizing bookmarks per contentsComputer Networks and ISDN Systems, 1996
- HyPursuitPublished by Association for Computing Machinery (ACM) ,1996
- Myriad: Design and implementation of a federated database prototypeSoftware: Practice and Experience, 1995
- ALIWEB - Archie-like indexing in the WEBComputer Networks and ISDN Systems, 1994
- Data-driven discovery of quantitative rules in relational databasesIEEE Transactions on Knowledge and Data Engineering, 1993