A client‐side Web agent for document categorization

1 December 1998

journal article
Published by Emerald Publishing in Internet Research

Vol. 8 (5) , 387-399
https://doi.org/10.1108/10662249810241257

Abstract

The authors propose a client‐side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. In this paper, the overall architecture of this agent is described and the details of the algorithms within its key components are discussed.

Keywords

This publication has 18 references indexed in Scilit:

Customizable multi-engine search tool with clustering
Computer Networks and ISDN Systems, 1997
Finding salient features for personal Web page categories
Computer Networks and ISDN Systems, 1997
Syntactic clustering of the Web
Computer Networks and ISDN Systems, 1997
Learning to Teach through Action Research
Action in Teacher Education, 1997
Automatically organizing bookmarks per contents
Computer Networks and ISDN Systems, 1996
Using Linear Algebra for Intelligent Information Retrieval
SIAM Review, 1995
Acquiring recursive and iterative concepts with explanation-based learning
Machine Learning, 1990
Clustering Analysis and Its Applications
Published by Springer Nature ,1981
An algorithm for suffix stripping
Program: electronic library and information systems, 1980
Clustering Methodologies in Exploratory Data Analysis
Published by Elsevier ,1980