Optimization of join operations in horizontally partitioned database systems
- 1 March 1986
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 11 (1) , 48-80
- https://doi.org/10.1145/5236.5241
Abstract
This paper analyzes the problem of joining two horizontally partitioned relations in a distributed database system. Two types of semijoin strategies are introduced, local and remote. Local semijoins are performed at the site of the restricted relation (or fragment), and remote semijoins can be performed at an arbitrary site. A mathematical model of a semijoin strategy for the case of remote semijoins is developed, and lower bounding and heuristic procedures are proposed. The results of computational experiments are reported. The experiments include an analysis of the heuristics' performance relative to the lower bounds, sensitivity analysis, and error analysis. These results reveal a good performance of the heuristic procedures, and demonstrate the benefit of using semijoin operations to reduce the size of fragments prior to their transmission. The algorithms for the case of remote semijoins were found to be superior to the algorithms for the case of local semijoins. In addition, we found that the estimation accuracy of the selectivity factors has a significant effect on the incurred communication cost.Keywords
This publication has 15 references indexed in Scilit:
- Topological design of centralized computer networks—formulations and algorithmsNetworks, 1982
- Query processing in a system for distributed databases (SDD-1)ACM Transactions on Database Systems, 1981
- Using Semi-Joins to Solve Relational QueriesJournal of the ACM, 1981
- A Dual-Based Procedure for Uncapacitated Facility LocationOperations Research, 1978
- Lagrangean Relaxation Applied to Capacitated Facility Location ProblemsA I I E Transactions, 1978
- Exceptional Paper—Location of Bank Accounts to Optimize Float: An Analytic Study of Exact and Approximate AlgorithmsManagement Science, 1977
- Decomposition—a strategy for query processingACM Transactions on Database Systems, 1976
- Multicommodity Distribution System Design by Benders DecompositionManagement Science, 1974
- Lagrangean relaxation for integer programmingPublished by Springer Nature ,1974
- The Traveling-Salesman Problem and Minimum Spanning TreesOperations Research, 1970