A clustered search algorithm incorporating arbitrary term dependencies
- 1 September 1982
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 7 (3) , 500-508
- https://doi.org/10.1145/319732.319756
Abstract
The documents in a database are organized into clusters, where each cluster contains similar documents and a representative of these documents. A user query is compared with all the representatives of the clusters, and on the basis of such comparisons, those clusters having many close neighbors with respect to the query are selected for searching. This paper presents an estimation of the number of close neighbors in a cluster in relation to the given query. The estimation takes into consideration the dependencies between terms. It is demonstrated by experiments that the estimate is accurate and the time to generate the estimate is small.Keywords
This publication has 7 references indexed in Scilit:
- On the Construction of Feedback QueriesJournal of the ACM, 1982
- A model of cluster searching based on classificationInformation Systems, 1980
- AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATAJournal of Documentation, 1978
- On the estimation of the number of desired records with respect to a given queryACM Transactions on Database Systems, 1978
- Evaluation of information retrieval systems: A decision theory approachJournal of the American Society for Information Science, 1978
- A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVALJournal of Documentation, 1977
- The Association Factor in Information RetrievalJournal of the ACM, 1961