Segmental vocoder-going beyond the phonetic approach
- 27 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 605-608 vol.2
- https://doi.org/10.1109/icassp.1998.675337
Abstract
The problem of very low bit rate segmental speech coding is addressed. The basic units are found automatically in the training database using temporal decomposition, vector quantization and multigrams. They are modelled by HMMs. The coding is based on recognition and synthesis. In single speaker tests, we obtained intelligible and naturally sounding speech at a mean rate of 211.2 b/s. In the end, future extensions of our scheme (diphone-like synthesis and speaker adaptation) as well as possible use of automatically derived units in recognition are discussed.Keywords
This publication has 5 references indexed in Scilit:
- A phonetic vocoderPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Variable dimension vector quantization of linear predictive coefficients of speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Phonetic vocoding with speaker adaptationPublished by International Speech Communication Association ,1997
- Variable-length sequence matching for phonetic transcription using joint multigramsPublished by International Speech Communication Association ,1995
- Variable-length sequence modeling: multigramsIEEE Signal Processing Letters, 1995