A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques

27 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 609-612 vol.2
https://doi.org/10.1109/icassp.1998.675338

Abstract

This paper presents a very low bit rate speech coder based on HMM (hidden Markov model). The encoder carries out phoneme recognition, and transmits phoneme indexes, state durations and pitch information to the decoder. In the decoder, phoneme HMMs are concatenated according to the phoneme indexes, and a sequence of mel-cepstral coefficient vectors is generated from the concatenated HMM by using an ML-based speech parameter generation technique. Finally we obtain synthetic speech by exciting the MLSA (mel log spectrum approximation) filter, whose coefficients are given by mel-cepstral coefficients, according to the pitch information. A subjective listening test shows that the performance of the proposed coder at about 150 bit/s (for the test data including 26% silence region) is comparable to a VQ-based vocoder at 400 bit/s (=8 bit/frame/spl times/50 frame/s) without pitch quantization for both coders.

Keywords

This publication has 9 references indexed in Scilit:

A segment vocoder at 150 b/s
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A phonetically labeled acoustic segment (PLAS) approach to speech analysis-synthesis
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Speech synthesis using HMMs with dynamic features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Voice characteristics conversion for HMM-based speech synthesis system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Speech parameter generation from HMM using dynamic features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Phonetic vocoding with speaker adaptation
Published by International Speech Communication Association ,1997
An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
Published by International Speech Communication Association ,1995
An adaptive algorithm for mel-cepstral analysis of speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
LPC speech coding based on variable-length segment quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988