Some inconsistencies and misnomers in probabilistic information retrieval

1 September 1991

proceedings article
Published by Association for Computing Machinery (ACM)

p. 57-61
https://doi.org/10.1145/122860.122866

Abstract

The probabilistic theory of information retrieval involves the construction of mathematical models based on statistical assumptions of various sorts. One of the hazards inherent in this kind of theory construction is that the assumptions laid down may be inconsistent with the data to which they are applied. Another hazard is that the stated assumptions may not be the real assumptions on which the derived modelling equations or resulting experiments are actually based. Both kinds of error have been made repeatedly in research on probabilistic information retrieval. One consequence of these lapses is that the statistical character of certain probabilistic IR models, including the so-called ‘binary independence’ model, has been seriously misapprehended.

Keywords

This publication has 9 references indexed in Scilit:

A study of probabilistic information retrieval systems in the case of inconsistent expert judgments
Journal of the American Society for Information Science, 1991
Probabilistic document indexing from relevance feedback data
Published by Association for Computing Machinery (ACM) ,1989
An Inductive Search System: Theory, Design, and Implementation
IEEE Transactions on Systems, Man, and Cybernetics, 1986
Exploiting the maximum entropy principle to increase retrieval effectiveness
Journal of the American Society for Information Science, 1983
AN EVALUATION OF FEEDBACK IN DOCUMENT RETRIEVAL USING CO‐OCCURRENCE DATA
Journal of Documentation, 1978
Relevance weighting of search terms
Journal of the American Society for Information Science, 1976
Precision Weighting—An Effective Automatic Indexing Method
Journal of the ACM, 1976
A PROBABILISTIC SEARCH STRATEGY FORMEDLARS
Journal of Documentation, 1971
On Relevance, Probabilistic Indexing and Information Retrieval
Journal of the ACM, 1960