Analysis of a heuristic for full trie minimization
- 1 September 1981
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 6 (3) , 513-537
- https://doi.org/10.1145/319587.319618
Abstract
A trie is a distributed-key search tree in which records from a file correspond to leaves in the tree. Retrieval consists of following a path from one root to a leaf, where the choice of edge at each node is determined by attribute values of the key. For full tries, those in which all leaves lie at the same depth, the problem of finding an ordering of attributes which yields a minimum size trie is NP-complete. This paper considers a “greedy” heuristic for constructing low-cost tries. It presents simulation experiments which show that the greedy method tends to produce tries with small size, and analysis leading to a worst case bound on approximations produced by the heuristic. It also shows a class of files for which the greedy method may perform badly, producing tries of high cost.Keywords
This publication has 4 references indexed in Scilit:
- Heuristics for trie index minimizationACM Transactions on Database Systems, 1979
- The Complexity of Trie Index ConstructionJournal of the ACM, 1977
- Storage optimization of tree structured files representing descriptor setsPublished by Association for Computing Machinery (ACM) ,1971
- Trie memoryCommunications of the ACM, 1960