Relevance based language models

Top Cited Papers

1 September 2001

proceedings article
Published by Association for Computing Machinery (ACM)

p. 120-127
https://doi.org/10.1145/383952.383972

Abstract

We explore the relation between classical probabilistic models of information retrieval and the emerging language modeling approaches. It has long been recognized that the primary obstacle to effective performance of classical models is the need to estimate arelevance model: probabilities of words in the relevant class. We propose a novel technique for estimating these probabilities using the query alone. We demonstrate that our technique can produce highly accurate relevance models, addressing important notions of synonymy and polysemy. Our experiments show relevance models outperforming baseline language modeling systems on TREC retrieval and TDT tracking tasks. The main contribution of this work is an effective formal method for estimating a relevance model with no training data

Keywords

This publication has 12 references indexed in Scilit:

Bridging the lexical chasm
Published by Association for Computing Machinery (ACM) ,2000
OCELOT
Published by Association for Computing Machinery (ACM) ,2000
Improving the effectiveness of information retrieval with local context analysis
ACM Transactions on Information Systems, 2000
A general language model for information retrieval (poster abstract)
Published by Association for Computing Machinery (ACM) ,1999
Information retrieval as statistical translation
Published by Association for Computing Machinery (ACM) ,1999
A hidden Markov model information retrieval system
Published by Association for Computing Machinery (ACM) ,1999
A language modeling approach to information retrieval
Published by Association for Computing Machinery (ACM) ,1998
On-line new event detection and tracking
Published by Association for Computing Machinery (ACM) ,1998
An empirical study of smoothing techniques for language modeling
Published by Association for Computational Linguistics (ACL) ,1996
A THEORETICAL BASIS FOR THE USE OF CO‐OCCURRENCE DATA IN INFORMATION RETRIEVAL
Journal of Documentation, 1977