Segment quantization for very-low-rate speech coding
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 7, 1565-1568
- https://doi.org/10.1109/icassp.1982.1171472
Abstract
We introduce a new method for very-low-rate vocoding that the input speech as a sequence of variable-length segments. A segment is a by a spectrum of frames, where each frame is represented by a spectrum, pitch and gain. We use an automatic segmentation algorithm to obtain segments with an average duration comparable to that of a phoneme. A segment is quantized as a single block. The distance measure used for quantization incooporates the appropriate time alignment of two segments. We employ a computationally efficient metric that does not use the usual dynamic programming time warping. Two basic vocoders using the above approach of block quantization have been used to transmit intelligible speech at 200 b/s.Keywords
This publication has 4 references indexed in Scilit:
- A preliminary design of a phonetic vocoder based on a diphone modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Recent developments in vector quantization for speech processingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A geometric treatment of the source encoding of a Gaussian random variableIEEE Transactions on Information Theory, 1968
- Similarity Measure for Automatic Speech and Speaker RecognitionThe Journal of the Acoustical Society of America, 1968