Optimizing join queries in distributed databases
- 1 September 1988
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Software Engineering
- Vol. 14 (9) , 1319-1326
- https://doi.org/10.1109/32.6175
Abstract
A reduced cover set of the set of full reducer semijoin programs for an acyclic query graph for a distributed database system is given. An algorithm is presented that determines the minimum cost full reducer program. The computational complexity of finding the optimal full reducer for a single relation is of the same order as that of finding the optimal full reducer for all relations. The optimization algorithm is able to handle query graphs where more than one attribute is common between the relations. A method for determining the optimum profitable semijoin program is presented. A low-cost algorithm which determines a near-optimal profitable semijoin program is outlined. This is done by converting a semijoin program into a partial order graph. This graph also allows one to maximize the concurrent processing of the semijoins. It is shown that the minimum response time is given by the largest cost path of the partial order graph. This reducibility is used as a post optimizer for the SSD-1 query optimization algorithm. It is shown that the least upper bound on the length of any profitable semijoin program is N(N-1) for a query graph of N nodes.<>Keywords
This publication has 8 references indexed in Scilit:
- Optimization of distributed tree queriesJournal of Computer and System Sciences, 1984
- Distributed query processingACM Computing Surveys, 1984
- Optimization Algorithms for Distributed QueriesIEEE Transactions on Software Engineering, 1983
- Query processing in a system for distributed databases (SDD-1)ACM Transactions on Database Systems, 1981
- Using Semi-Joins to Solve Relational QueriesJournal of the ACM, 1981
- The Architectural Features and Implementation Techniques of the Multicell CASSMIEEE Transactions on Computers, 1979
- Implementing a relational database by means of specialzed hardwareACM Transactions on Database Systems, 1979
- Distributed query processing in a relational data base systemPublished by Association for Computing Machinery (ACM) ,1978