Relevance based language models
Top Cited Papers
- 1 September 2001
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 120-127
- https://doi.org/10.1145/383952.383972
Abstract
We explore the relation between classical probabilistic models of information retrieval and the emerging language modeling approaches. It has long been recognized that the primary obstacle to effective performance of classical models is the need to estimate arelevance model: probabilities of words in the relevant class. We propose a novel technique for estimating these probabilities using the query alone. We demonstrate that our technique can produce highly accurate relevance models, addressing important notions of synonymy and polysemy. Our experiments show relevance models outperforming baseline language modeling systems on TREC retrieval and TDT tracking tasks. The main contribution of this work is an effective formal method for estimating a relevance model with no training dataKeywords
This publication has 12 references indexed in Scilit:
- Bridging the lexical chasmPublished by Association for Computing Machinery (ACM) ,2000
- OCELOTPublished by Association for Computing Machinery (ACM) ,2000
- Improving the effectiveness of information retrieval with local context analysisACM Transactions on Information Systems, 2000
- A general language model for information retrieval (poster abstract)Published by Association for Computing Machinery (ACM) ,1999
- Information retrieval as statistical translationPublished by Association for Computing Machinery (ACM) ,1999
- A hidden Markov model information retrieval systemPublished by Association for Computing Machinery (ACM) ,1999
- A language modeling approach to information retrievalPublished by Association for Computing Machinery (ACM) ,1998
- On-line new event detection and trackingPublished by Association for Computing Machinery (ACM) ,1998
- An empirical study of smoothing techniques for language modelingPublished by Association for Computational Linguistics (ACL) ,1996
- A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVALJournal of Documentation, 1977