Classification of scientific documents by means of self‐generated groups employing free language
- 1 September 1973
- journal article
- Published by Wiley in Journal of the American Society for Information Science
- Vol. 24 (5) , 382-396
- https://doi.org/10.1002/asi.4630240510
Abstract
A study was undertaken to classify mechanically a document collection using the free‐language words in the titles and abstracts of a corpus of 261 physics research papers. Using a clustering algorithm, results were obtained which closely duplicated the clusters obtained by previous experiments with citations. A brief comparison is made with a traditional manual classification system. It is shown that the mechanical procedure is capable of achieving simultaneous average relevance and recall figures above 80%.Keywords
This publication has 5 references indexed in Scilit:
- Automatic classification and retrieval of documents by means of a bibliographic pattern discovery algorithmInformation Storage and Retrieval, 1971
- An Analysis of Some Graph Theoretical Cluster TechniquesJournal of the ACM, 1970
- A clustering experiment: First step towards a computer-generated classification schemeInformation Storage and Retrieval, 1968
- On Some Clustering TechniquesIBM Journal of Research and Development, 1964
- Bibliographic coupling between scientific papersAmerican Documentation, 1963