Segment quantization for very-low-rate speech coding

24 March 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 7, 1565-1568
https://doi.org/10.1109/icassp.1982.1171472

Abstract

We introduce a new method for very-low-rate vocoding that the input speech as a sequence of variable-length segments. A segment is a by a spectrum of frames, where each frame is represented by a spectrum, pitch and gain. We use an automatic segmentation algorithm to obtain segments with an average duration comparable to that of a phoneme. A segment is quantized as a single block. The distance measure used for quantization incooporates the appropriate time alignment of two segments. We employ a computationally efficient metric that does not use the usual dynamic programming time warping. Two basic vocoders using the above approach of block quantization have been used to transmit intelligible speech at 200 b/s.

Keywords

This publication has 4 references indexed in Scilit:

A preliminary design of a phonetic vocoder based on a diphone model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Recent developments in vector quantization for speech processing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A geometric treatment of the source encoding of a Gaussian random variable
IEEE Transactions on Information Theory, 1968
Similarity Measure for Automatic Speech and Speaker Recognition
The Journal of the Acoustical Society of America, 1968