The impact on retrieval effectiveness of skewed frequency distributions
- 1 October 1999
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems
- Vol. 17 (4) , 440-465
- https://doi.org/10.1145/326440.326447
Abstract
We present an analysis of word senses that provides a fresh insight into the impact of word ambiguity on retrieval effectiveness with potential broader implications for other processes of information retrieval. Using a methodology of forming artifically ambiguous words, known as pseudowords, and through reference to other researchers' work, the analysis illustrates that the distribution of the frequency of occurrance of the senses of a word plays a strong role in ambiguity's impact of effectiveness. Further investigation shows that this analysis may also be applicable to other processes of retrieval, such as Cross Language Information Retrieval, query expansion, retrieval of OCR'ed texts, and stemming. The analysis appears to provide a means of explaining, at least in part, reasons for the processes' impact (or lack of it) on effectiveness.Keywords
This publication has 11 references indexed in Scilit:
- Corpus-based stemming using cooccurrence of word variantsACM Transactions on Information Systems, 1998
- WordNetCommunications of the ACM, 1995
- Query Expansion using Lexical-Semantic RelationsPublished by Springer Nature ,1994
- Word Sense Disambiguation and Information RetrievalPublished by Springer Nature ,1994
- Explorations in Automatic Thesaurus DiscoveryPublished by Springer Nature ,1994
- Lexical ambiguity and information retrievalACM Transactions on Information Systems, 1992
- Providing machine tractable dictionary toolsMachine Translation, 1990
- Extended Boolean information retrievalCommunications of the ACM, 1983
- DOCUMENT RETRIEVAL EXPERIMENTS USING INDEXING VOCABULARIES OF VARYING SIZE. I. VARIETY GENERATION SYMBOLS ASSIGNED TO THE FRONTS OF INDEX TERMSJournal of Documentation, 1979
- Learning to disambiguateInformation Storage and Retrieval, 1973