A client‐side Web agent for document categorization
- 1 December 1998
- journal article
- Published by Emerald Publishing in Internet Research
- Vol. 8 (5) , 387-399
- https://doi.org/10.1108/10662249810241257
Abstract
The authors propose a client‐side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. In this paper, the overall architecture of this agent is described and the details of the algorithms within its key components are discussed.Keywords
This publication has 18 references indexed in Scilit:
- Customizable multi-engine search tool with clusteringComputer Networks and ISDN Systems, 1997
- Finding salient features for personal Web page categoriesComputer Networks and ISDN Systems, 1997
- Syntactic clustering of the WebComputer Networks and ISDN Systems, 1997
- Learning to Teach through Action ResearchAction in Teacher Education, 1997
- Automatically organizing bookmarks per contentsComputer Networks and ISDN Systems, 1996
- Using Linear Algebra for Intelligent Information RetrievalSIAM Review, 1995
- Acquiring recursive and iterative concepts with explanation-based learningMachine Learning, 1990
- Clustering Analysis and Its ApplicationsPublished by Springer Nature ,1981
- An algorithm for suffix strippingProgram: electronic library and information systems, 1980
- Clustering Methodologies in Exploratory Data AnalysisPublished by Elsevier ,1980