Selective modeling of the LPC residual during unvoiced frames: White noise or pulse excitation

24 March 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 11, 3087-3090
https://doi.org/10.1109/icassp.1986.1168772

Abstract

This paper presents a new method of modeling the LPC residual during unvoiced speech for voice coding at 4.8 kb/s. With this method, speech is synthesized using one of three excitation types: periodic pitch pulses, random noise, or multipulse. By using multipulse excitation it is possible to accurately produce speech which is difficult to model using noise and pitch pulses alone [1]. Since multipulse is only used where appropriate, efficient, sub-optimal methods of calculating the pulse amplitudes and positions are adequate, simplifying the implementation into a real-time system. The synthetic speech may be coded at 4.8 kb/s since multipulse, used only where appropriate, suffers little quality loss when quantized. A method of determining which excitation type is to be used is discussed. Formal listening test results are also presented.

Keywords

This publication has 5 references indexed in Scilit:

A new model of LPC excitation for producing natural-sounding speech at low bit rates
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Fast and accurate pitch detection using pattern recognition and adaptive time-domain analysis
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A Subjective Comparison of Selected Digital Codecs for Speech
Bell System Technical Journal, 1978
Linear Prediction of Speech
Published by Springer Nature ,1976
Parallel Processing Techniques for Estimating Pitch Periods of Speech in the Time Domain
The Journal of the Acoustical Society of America, 1969