Allocating data and operations to nodes in distributed database design
- 1 April 1995
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 7 (2) , 305-317
- https://doi.org/10.1109/69.382299
Abstract
The allocation of data and operations to nodes in a computer communications network is a critical issue in distributed database design. An efficient distributed database design must trade off performance and cost among retrieval and update activities at the various nodes. It must consider the concurrency control mechanism used as well as capacity constraints at nodes and on links in the network. It must determine where data will be allocated, the degree of data replication, which copy of the data will be used for each retrieval activity, and where operations such as select, project, join, and union will be performed. We develop a comprehensive mathematical modeling approach for this problem. The approach first generates units of data (file fragments) to be allocated from a logical data model representation and a characterization of retrieval and update activities. Retrieval and update activities are then decomposed into relational operations on these fragments. Both fragments and operations on them are then allocated to nodes using a mathematical modeling approach. The mathematical model considers network communication, local processing, and data storage costs. A genetic algorithm is developed to solve this mathematical formulation.Keywords
This publication has 17 references indexed in Scilit:
- The multicast policy and its relationship to replicated data placementACM Transactions on Database Systems, 1991
- Distributed database design: a practical approach and exampleACM SIGMOD Record, 1989
- Data allocation in distributed database systemsACM Transactions on Database Systems, 1988
- On the Interdependencies between Record Structure and Access Path DesignJournal of Management Information Systems, 1987
- Fragmentation: a technique for efficient query processingACM Transactions on Database Systems, 1986
- Vertical partitioning algorithms for database designACM Transactions on Database Systems, 1984
- An algorithm for concurrency control and recovery in replicated distributed databasesACM Transactions on Database Systems, 1984
- Approximating block accesses in database organizationsInformation Processing Letters, 1984
- Implications of certain assumptions in database performance evauationACM Transactions on Database Systems, 1984
- Approximating block accesses in database organizationsCommunications of the ACM, 1977