Segmental vocoder-going beyond the phonetic approach

27 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 605-608 vol.2
https://doi.org/10.1109/icassp.1998.675337

Abstract

The problem of very low bit rate segmental speech coding is addressed. The basic units are found automatically in the training database using temporal decomposition, vector quantization and multigrams. They are modelled by HMMs. The coding is based on recognition and synthesis. In single speaker tests, we obtained intelligible and naturally sounding speech at a mean rate of 211.2 b/s. In the end, future extensions of our scheme (diphone-like synthesis and speaker adaptation) as well as possible use of automatically derived units in recognition are discussed.

Keywords

This publication has 5 references indexed in Scilit:

A phonetic vocoder
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Variable dimension vector quantization of linear predictive coefficients of speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Phonetic vocoding with speaker adaptation
Published by International Speech Communication Association ,1997
Variable-length sequence matching for phonetic transcription using joint multigrams
Published by International Speech Communication Association ,1995
Variable-length sequence modeling: multigrams
IEEE Signal Processing Letters, 1995