Semantic term matching in axiomatic approaches to information retrieval
- 6 August 2006
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 115-122
- https://doi.org/10.1145/1148170.1148193
Abstract
A common limitation of many retrieval models, including the recently proposed axiomatic approaches, is that retrieval scores are solely based on exact (i.e., syntactic) matching of terms in the queries and documents, without allowing distinct but semantically related terms to match each other and contribute to the retrieval score. In this paper, we show that semantic term matching can be naturally incorporated into the axiomatic retrieval model through defining the primitive weighting function based on a semantic similarity function of terms. We define several desirable retrieval constraints for semantic term matching and use such constraints to extend the axiomatic model to directly support semantic term matching based on the mutual information of terms computed on some document set. We show that such extension can be efficiently implemented as query expansion. Experiment results on several representative data sets show that, with mutual information computed over the documents in either the target collection for retrieval or an external collection such as the Web, our semantic expansion consistently and substantially improves retrieval accuracy over the baseline axiomatic retrieval model. As a pseudo feedback method, our method also outperforms a state-of-the-art language modeling feedback method.Keywords
This publication has 24 references indexed in Scilit:
- Resolving query translation ambiguity using a decaying co-occurrence model and syntactic dependence relationsPublished by Association for Computing Machinery (ACM) ,2002
- Lexical chains for question answeringPublished by Association for Computational Linguistics (ACL) ,2002
- Model-based feedback in the language modeling approach to information retrievalPublished by Association for Computing Machinery (ACM) ,2001
- Query term disambiguation for Web cross-language information retrieval using a search enginePublished by Association for Computing Machinery (ACM) ,2000
- Using mutual information to resolve query translation ambiguities and query term weightingPublished by Association for Computational Linguistics (ACL) ,1999
- A cooccurrence-based thesaurus and two applications to information retrievalInformation Processing & Management, 1997
- Query expansion using local and global document analysisPublished by Association for Computing Machinery (ACM) ,1996
- The Retrieval Effects of Query Expansion on a Feedback Document Retrieval SystemThe Computer Journal, 1983
- Word‐word associations in document retrieval systemsAmerican Documentation, 1969
- On Relevance, Probabilistic Indexing and Information RetrievalJournal of the ACM, 1960