Analysis of performance variation using query expansion
- 30 January 2003
- journal article
- research article
- Published by Wiley in Journal of the American Society for Information Science and Technology
- Vol. 54 (5) , 379-391
- https://doi.org/10.1002/asi.10217
Abstract
Information retrieval performance evaluation is commonly made based on the classical recall and precision based figures or graphs. However, important information indicating causes for variation may remain hidden under the average recall and precision figures. Identifying significant causes for variation can help researchers and developers to focus on opportunities for improvement that underlay the averages. This article presents a case study showing the potential of a statistical repeated measures analysis of variance for testing the significance of factors in retrieval performance variation. The TREC‐9 Query Track performance data is used as a case study and the factors studied are retrieval method, topic, and their interaction. The results show that retrieval method, topic, and their interaction are all significant. A topic level analysis is also made to see the nature of variation in the performance of retrieval methods across topics. The observed retrieval performances of expansion runs are truly significant improvements for most of the topics. Analyses of the effect of query expansion on document ranking confirm that expansion affects ranking positively.Keywords
This publication has 23 references indexed in Scilit:
- Searching the web: The public and their queriesJournal of the American Society for Information Science and Technology, 2001
- An information-theoretic approach to automatic query expansionACM Transactions on Information Systems, 2001
- Evaluating evaluation measure stabilityPublished by Association for Computing Machinery (ACM) ,2000
- Improving the effectiveness of information retrieval with local context analysisACM Transactions on Information Systems, 2000
- Combining multiple evidence from different types of thesaurus for query expansionPublished by Association for Computing Machinery (ACM) ,1999
- The impact of query structure and query expansion on retrieval performancePublished by Association for Computing Machinery (ACM) ,1998
- Concept based query expansionPublished by Association for Computing Machinery (ACM) ,1993
- Effectiveness of query expansion in ranked-output document retrieval systemsJournal of Information Science, 1992
- The limitations of term co-occurrence data for query expansion in document retrieval systemsJournal of the American Society for Information Science, 1991
- Towards interactive query expansionPublished by Association for Computing Machinery (ACM) ,1988