Continuous representations in linear predictive coding
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15206149,p. 201-204 vol.1
- https://doi.org/10.1109/icassp.1991.150312
Abstract
A major source of audible distortion in current low-bit-rate speech coding algorithms is an inaccurate degree of periodicity of the voiced speech signal. If the correlations between neighboring pitch cycles are accurately reproduced, these audible distortions can be reduced significantly. To this purpose, a novel method of coding voiced speech is introduced, which transmits an encoded prototype waveform at 20-30 ms intervals. The prototype waveform describes a pitch cycle representative for the interval, and is quantized using analysis-by-synthesis methods. The speech signal is reconstructed by concatenation of interpolated prototype waveforms. The short-term and the long-term correlations between pitch cycles can be controlled explicitly. Unquantized reconstructed speech is virtually indistinguishable from the original signal. The method results in excellent speech quality at rates between 3.0 and 4.0 kb/s.Keywords
This publication has 6 references indexed in Scilit:
- Phase coherence in speech reconstruction for enhancement and coding applicationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- High-quality digital speech at 4 kb/sPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Constrained-Stochastic Excitation Coding of Speech at 4.8 kb/sPublished by Springer Nature ,1991
- Beyond Multipulse and CELP Towards High Quality Speech at 4 Kb/sPublished by Springer Nature ,1991
- Multiband excitation vocoderIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- A logical calculus of the ideas immanent in nervous activityBulletin of Mathematical Biology, 1943