Improvements in 2.4 kbps high-quality speech coding
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 145-148 vol.2
- https://doi.org/10.1109/icassp.1992.226099
Abstract
An algorithm for 2.4 kb/s speech coding is described. The main problem addressed is the coding of voiced speech. A way of coding the pitch structure is introduced. Compared with traditional coding schemes, it results in a better compromise between bit allocation for short-term quantization and residual coding. The coder uses vector quantization of the short-term parameters (line spectrum frequencies). The residual is lowpass filtered to obtain the baseband signal. Unvoiced frames are coded by means of a method based on repetition and interpolation of pitch pulses. The method exploits the high correlation between pitch pulses. Harmonic postfiltering is applied to obtain an improved high-frequency regeneration.Keywords
This publication has 8 references indexed in Scilit:
- Experiments with a high quality, low complexity 4800 bps residual excited LPC (RELP) vocoderPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Improved pitch prediction with fractional delays in CELP codingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Beyond Multipulse and CELP Towards High Quality Speech at 4 Kb/sPublished by Springer Nature ,1991
- A 2.4 kbps high-quality speech coderPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Efficient vector quantization of LPC parameters at 24 bits/framePublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Continuous representations in linear predictive codingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Speech coding at 4 kb/s and lower using single-pulse and stochastic models of LPC excitationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Epoch extraction from linear prediction residual for identification of closed glottis intervalIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979