Information retrieval using word senses
- 25 July 2004
- conference paper
- Published by Association for Computing Machinery (ACM)
- p. 258-265
- https://doi.org/10.1145/1008992.1009038
Abstract
Information retrieval using word senses is emerging as a good research challenge on semantic information retrieval. In this paper, we propose a new method using word senses in information retrieval: root sense tagging method. This method assigns coarse-grained word senses defined in WordNet to query terms and document terms by unsupervised way using co-occurrence information constructed automatically. Our sense tagger is crude, but performs consistent disambiguation by considering only the single most informative word as evidence to disambiguate the target word. We also allow multiple-sense assignment to alleviate the problem caused by incorrect disambiguation.Experimental results on a large-scale TREC collection show that our approach to improve retrieval effectiveness is successful, while most of the previous work failed to improve performances even on small text collection. Our method also shows promising results when is combined with pseudo relevance feedback and state-of-the-art retrieval function such as BM25.Keywords
This publication has 9 references indexed in Scilit:
- Word sense disambiguation in information retrieval revisitedPublished by Association for Computing Machinery (ACM) ,2003
- A probabilistic model of information retrieval: development and comparative experimentsInformation Processing & Management, 2000
- Retrieving with Good SenseInformation Retrieval Journal, 2000
- The impact on retrieval effectiveness of skewed frequency distributionsACM Transactions on Information Systems, 1999
- Probabilistic latent semantic indexingPublished by Association for Computing Machinery (ACM) ,1999
- Unsupervised word sense disambiguation rivaling supervised methodsPublished by Association for Computational Linguistics (ACL) ,1995
- Using WordNet to disambiguate word senses for text retrievalPublished by Association for Computing Machinery (ACM) ,1993
- Lexical ambiguity and information retrievalACM Transactions on Information Systems, 1992
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990