A probabilistic relational algebra for the integration of information retrieval and database systems
- 1 January 1997
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Information Systems
- Vol. 15 (1) , 32-66
- https://doi.org/10.1145/239041.239045
Abstract
We present a probabilistic relational algebra (PRA) which is a generalization of standard relational algebra. In PRA, tuples are assigned probabilistic weights giving the probability that a tuple belongs to a relation. Based on intensional semantics, the tuple weights of the result of a PRA expression always conform to the underlying probabilistic model. We also show for which expressions extensional semantics yields the same results. Furthermore, we discuss complexity issues and indicate possibilities for optimization. With regard to databases, the approach allows for representing imprecise attribute values, whereas for information retrieval, probabilistic document indexing and probabilistic search term weighting can be modeled. We introduce the concept of vague predicates which yield probabilistic weights instead of Boolean values, thus allowing for queries with vague selection conditions. With these features, PRA implements uncertainty and vagueness in combination with the relational model.Keywords
This publication has 33 references indexed in Scilit:
- Searching structured documents with the enhanced retrieval functionality of freeWAIS-sf and SFgateComputer Networks and ISDN Systems, 1995
- Accommodating subjective vagueness through a fuzzy extension to the relational data modelInformation Systems, 1993
- Fuzzy database query languages and their relational completeness theoremIEEE Transactions on Knowledge and Data Engineering, 1993
- The management of probabilistic dataIEEE Transactions on Knowledge and Data Engineering, 1992
- Accommodating imprecision in database systemsACM SIGMOD Record, 1990
- Non-first normal form universal relations: An application to information retrieval systemsInformation Systems, 1987
- Missing information (applicable and inapplicable) in relational databasesACM SIGMOD Record, 1986
- A non-classical logic for information retrievalThe Computer Journal, 1986
- Implementing imprecision in information systemsInformation Sciences, 1985
- Query Optimization in Database SystemsACM Computing Surveys, 1984