Combinatorial Compression and Partitioning of Large Dictionaries

Open Access

1 November 1983

journal article
Published by Oxford University Press (OUP) in The Computer Journal

Vol. 26 (4) , 336-343
https://doi.org/10.1093/comjnl/26.4.336

Abstract

A method for compressing large dictionaries is proposed, based on transforming words into lexicographically ordered strings of distinct letters, together with permutation indexes. Algorithms to generate such strings are described. Results of applying the method to the dictionaries of two large databases, in Hebrew and English, are presented. The main message is a method of partitioning the dictionary such that the ‘information bearing fraction’ is stored in fast memory, and the bulk in auxiliary memory.

Keywords

MEMORY
COMPRESSION
PARTITIONING
WORDS
ENGLISH
LETTERS
HEBREW
STRINGS
LARGE DICTIONARIES

This publication has 0 references indexed in Scilit: