Variable dimension vector quantization of linear predictive coefficients of speech

17 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. i (15206149) , I/505-I/508
https://doi.org/10.1109/icassp.1994.389245

Abstract

We introduce a method for locally optimal variable-to-variable length source coding with distortion, and apply it to coding the linear predictive coefficients of speech. The method is similar to entropy-constrained vector quantization, but it uses a dynamic programming algorithm to encode. The method automatically discovers variable-length source structure, in this case the acoustic-phonetic structure of speech. Using this structure, it is possible to compress the linear predictive coefficients of speech to one-third the rate of entropy-constrained vector quantization of speech, with no increase in spectral distortion. Auditory tests reveal that using this method, the spectral component of speech can be coded naturally and intelligibly to as low as 50 bits per second.

Keywords

This publication has 16 references indexed in Scilit:

Variable block-size image coding
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Segment quantization for very-low-rate speech coding
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Word recognition using whole word and subword models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Frame compression in hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Conditional entropy-constrained vector quantization of linear predictive coefficients
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Efficient quadtree coding of images and video
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Optimal pruning with applications to tree-structured source coding and modeling
IEEE Transactions on Information Theory, 1989
A stochastic segment model for phoneme-based continuous speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
LPC speech coding based on variable-length segment quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
Continuously variable duration hidden Markov models for automatic speech recognition
Computer Speech & Language, 1986