Classification of scientific documents by means of self‐generated groups employing free language

1 September 1973

journal article
Published by Wiley in Journal of the American Society for Information Science

Vol. 24 (5) , 382-396
https://doi.org/10.1002/asi.4630240510

Abstract

A study was undertaken to classify mechanically a document collection using the free‐language words in the titles and abstracts of a corpus of 261 physics research papers. Using a clustering algorithm, results were obtained which closely duplicated the clusters obtained by previous experiments with citations. A brief comparison is made with a traditional manual classification system. It is shown that the mechanical procedure is capable of achieving simultaneous average relevance and recall figures above 80%.

Keywords

This publication has 5 references indexed in Scilit:

Automatic classification and retrieval of documents by means of a bibliographic pattern discovery algorithm
Information Storage and Retrieval, 1971
An Analysis of Some Graph Theoretical Cluster Techniques
Journal of the ACM, 1970
A clustering experiment: First step towards a computer-generated classification scheme
Information Storage and Retrieval, 1968
On Some Clustering Techniques
IBM Journal of Research and Development, 1964
Bibliographic coupling between scientific papers
American Documentation, 1963