Predicting query performance
Top Cited Papers
- 11 August 2002
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 299-306
- https://doi.org/10.1145/564376.564429
Abstract
We develop a method for predicting query performance by computing the relative entropy between a query language model and the corresponding collection language model. The resulting clarity score measures the coherence of the language usage in documents whose models are likely to generate the query. We suggest that clarity scores measure the ambiguity of a query with respect to a collection of documents and show that they correlate positively with average precision in a variety of TREC test sets. Thus, the clarity score may be used to identify ineffective queries, on average, without relevance information. We develop an algorithm for automatically setting the clarity score threshold between predicted poorly-performing queries and acceptable queries and validate it using TREC data. In particular, we compare the automatic thresholds to optimum thresholds and also check how frequently results as good are achieved in sampling experiments that randomly assign queries to the two classes.Keywords
This publication has 12 references indexed in Scilit:
- Relevance based language modelsPublished by Association for Computing Machinery (ACM) ,2001
- Employing the resolution power of search keysJournal of the American Society for Information Science and Technology, 2001
- An information-theoretic approach to automatic query expansionACM Transactions on Information Systems, 2001
- A general language model for information retrieval (poster abstract)Published by Association for Computing Machinery (ACM) ,1999
- A language modeling approach to information retrievalPublished by Association for Computing Machinery (ACM) ,1998
- Selectional constraints: an information-theoretic model and its computational realizationCognition, 1996
- A new method of weighting query terms for ad-hoc retrievalPublished by Association for Computing Machinery (ACM) ,1996
- Viewing morphology as an inference processPublished by Association for Computing Machinery (ACM) ,1993
- An information-theoretic measure of term specificityJournal of the American Society for Information Science, 1992
- Monte Carlo MethodsPublished by Wiley ,1986