Workload balance and page access scheduling for parallel joins in shared-nothing systems

30 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 411-418
https://doi.org/10.1109/icde.1993.344040

Abstract

A methodology to resolve balancing and scheduling issues for parallel join execution in a shared-nothing multiprocessor environment are presented. In the past, research on parallel join methods focused on the design of algorithms for partitioning relations and distributing data buckets as evenly as possible to the processors. Once data are uniformly distributed to the processors, it is assumed that all processors will complete their tasks at about the same time. The authors stress that this is true if no further information, such as page-level join index, is available. Otherwise, the join execution can be further optimized and the workload in the processors may still be unbalanced. The authors study these problems in a shared-nothing environment.

Keywords

This publication has 12 references indexed in Scilit:

Dynamic and load-balanced task-oriented database query processing in parallel systems
Published by Springer Nature ,2005
Join strategies on KD-tree indexed relations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Processor scheduling for multiprocessor joins
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Scheduling of page fetches in join operations using B/sub c/-trees
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Spatial join indices
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Chained declustering: a new availability strategy for multiprocessor database machines
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
The Gamma database machine project
IEEE Transactions on Knowledge and Data Engineering, 1990
Prototyping Bubba, a highly parallel database system
IEEE Transactions on Knowledge and Data Engineering, 1990
The Grid File
ACM Transactions on Database Systems, 1984
Parallel algorithms for the execution of relational database operations
ACM Transactions on Database Systems, 1983