The Burrows-Wheeler Transform for Block Sorting Text Compression: Principles and Improvements
- 1 January 1996
- journal article
- research article
- Published by Oxford University Press (OUP) in The Computer Journal
- Vol. 39 (9) , 731-740
- https://doi.org/10.1093/comjnl/39.9.731
Abstract
A recent development in text compression is a ‘block sorting’ algorithm which permutes the input text according to a special sort procedure and then processes the permuted text with Move-To-Front (MTF) and a final statistical compressor. The technique combines good speed with excellent compression performance. This paper investigates the fundamental operation of the algorithm and presents some improvements based on that analysis. Although block sorting is clearly related to previous compression techniques, it appears that it is best described by techniques derived from work by Shannon on the prediction and entropy of English text. A simple model is developed which relates the compression to the proportion of zeros after the MTF stage.Keywords
This publication has 0 references indexed in Scilit: