Self-adaptive, on-line reclustering of complex object data
- 24 May 1994
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 23 (2) , 407-418
- https://doi.org/10.1145/191843.191924
Abstract
A likely trend in the development of future CAD, CASE and office information systems will be the use of object-oriented database systems to manage their internal data stores. The entities that these applications will retrieve, such as electronic parts and their connections or customer service records, are typically large complex objects composed of many interconnected heterogeneous objects, not thousands of tuples. These applications may exhibit widely shifting usage patterns due to their interactive mode of operation. Such a class of applications would demand clustering methods that are appropriate for clustering large complex objects and that can adapt on-line to the shifting usage patterns. While most object-oriented clustering methods allow grouping of heterogeneous objects, they are usually static and can only be changed off-line. We present one possible architecture for performing complex object reclustering in an on-line manner that is adaptive to changing usage patterns. Our architecture involves the decomposition of a clustering method into concurrently operating components that each handle one of the fundamental tasks involved in reclustering, namely statistics collection, cluster analysis, and reorganization. We present the results of an experiment performed to evaluate its behavior. These results show that the average miss rate for object accesses can be effectively reduced using a combination of rules that we have developed for deciding when cluster analyses and reorganizations should be performed.Keywords
This publication has 8 references indexed in Scilit:
- A stochastic approach for clustering in object basesPublished by Association for Computing Machinery (ACM) ,1991
- Effective clustering of complex objects in object-oriented databasesPublished by Association for Computing Machinery (ACM) ,1991
- Cactis: a self-adaptive, concurrent implementation of an object-oriented database management systemACM Transactions on Database Systems, 1989
- Clustering a DAG for CAD databasesIEEE Transactions on Software Engineering, 1988
- The 5 minute rule for trading memory for disc accesses and the 10 byte rule for trading memory for CPU timePublished by Association for Computing Machinery (ACM) ,1987
- Adaptive record clusteringACM Transactions on Database Systems, 1985
- A real-time garbage collector based on the lifetimes of objectsCommunications of the ACM, 1983
- A clustering algorithm for hierarchical structuresACM Transactions on Database Systems, 1977