Variable dimension vector quantization of linear predictive coefficients of speech
- 17 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. i (15206149) , I/505-I/508
- https://doi.org/10.1109/icassp.1994.389245
Abstract
We introduce a method for locally optimal variable-to-variable length source coding with distortion, and apply it to coding the linear predictive coefficients of speech. The method is similar to entropy-constrained vector quantization, but it uses a dynamic programming algorithm to encode. The method automatically discovers variable-length source structure, in this case the acoustic-phonetic structure of speech. Using this structure, it is possible to compress the linear predictive coefficients of speech to one-third the rate of entropy-constrained vector quantization of speech, with no increase in spectral distortion. Auditory tests reveal that using this method, the spectral component of speech can be coded naturally and intelligibly to as low as 50 bits per second.Keywords
This publication has 16 references indexed in Scilit:
- Variable block-size image codingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Segment quantization for very-low-rate speech codingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Word recognition using whole word and subword modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Frame compression in hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Conditional entropy-constrained vector quantization of linear predictive coefficientsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Efficient quadtree coding of images and videoPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Optimal pruning with applications to tree-structured source coding and modelingIEEE Transactions on Information Theory, 1989
- A stochastic segment model for phoneme-based continuous speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- LPC speech coding based on variable-length segment quantizationIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- Continuously variable duration hidden Markov models for automatic speech recognitionComputer Speech & Language, 1986