Generation and search of clustered files
- 1 December 1978
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 3 (4) , 321-346
- https://doi.org/10.1145/320289.320291
Abstract
A classified, or clustered file is one where related, or similar records are grouped into classes, or clusters of items in such a way that all items within a cluster are jointly retrievable. Clustered files are easily adapted to broad and narrow search strategies, and simple file updating methods are available. An inexpensive file clustering method applicable to large files is given together with appropriate file search methods. An abstract model is then introduced to predict the retrieval effectiveness of various search methods in a clustered file environment. Experimental evidence is included to test the versatility of the model and to demonstrate the role of various parameters in the cluster search process.Keywords
This publication has 14 references indexed in Scilit:
- A file organization and maintenance procedure for dynamic document collectionsInformation Processing & Management, 1975
- Attribute based file organization in a paged memory environmentCommunications of the ACM, 1974
- An evaluation of query expansion by the addition of clustered terms for a document retrieval systemInformation Storage and Retrieval, 1972
- Organization and maintenance of large ordered indexesActa Informatica, 1972
- The use of hierarchic clustering in information retrievalInformation Storage and Retrieval, 1971
- An Analysis of Some Graph Theoretical Cluster TechniquesJournal of the ACM, 1970
- Semantic Clustering of Index TermsJournal of the ACM, 1968
- A clustering experiment: First step towards a computer-generated classification schemeInformation Storage and Retrieval, 1968
- Automatic Document ClassificationJournal of the ACM, 1963
- Information Retrieval Based upon Latent Class AnalysisJournal of the ACM, 1962