A network approach to probabilistic information retrieval
- 1 July 1995
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems
- Vol. 13 (3) , 324-353
- https://doi.org/10.1145/203052.203067
Abstract
In this article we show how probabilistic information retrieval based on document components may be implemented as a feedforward (feedbackward) artificial neural network. The network supports adaptation of connection weights as well as the growing of new edges between queries and terms based on user relevance feedback data for training, and it reflects query modification and expansion in information retrieval. A learning rule is applied that can also be viewed as supporting sequential learning using a harmonic sequence learning rate. Experimental results with four standard small collections and a large Wall Street Journal collection (173,219 documents) show that performance of feedback improves substantially over no feedback, and further gains are obtained when queries are expanded with terms from the feedback documents. The effect is much more pronounced in small collections than in the large collection. Query expansion may be considered as a tool for both precision and recall enhancement. In particular, small query expansion levels of about 30 terms can achieve most of the gains at the low-recall high-precision region, while larger expansion levels continue to provide gains at the high-recall low-precision region of a precision recall curve.Keywords
This publication has 24 references indexed in Scilit:
- Experiments with a component theory of probabilistic information retrieval based on single terms as document componentsACM Transactions on Information Systems, 1990
- Indexing by latent semantic analysisJournal of the American Society for Information Science, 1990
- A generalization and clarification of the waller-kraft wish listInformation Processing & Management, 1989
- Optimum polynomial retrieval functions based on the probability ranking principleACM Transactions on Information Systems, 1989
- Experiments with document components for indexing and retrievalInformation Processing & Management, 1988
- Self-Organization and Associative MemoryPublished by Springer Nature ,1988
- RUBRIC: A System for Rule-Based Information RetrievalIEEE Transactions on Software Engineering, 1985
- USING PROBABILISTIC MODELS OF DOCUMENT RETRIEVAL WITHOUT RELEVANCE INFORMATIONJournal of Documentation, 1979
- A decision theoretic foundation for indexingJournal of the American Society for Information Science, 1975
- On Relevance, Probabilistic Indexing and Information RetrievalJournal of the ACM, 1960