Selective modeling of the LPC residual during unvoiced frames: White noise or pulse excitation
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 11, 3087-3090
- https://doi.org/10.1109/icassp.1986.1168772
Abstract
This paper presents a new method of modeling the LPC residual during unvoiced speech for voice coding at 4.8 kb/s. With this method, speech is synthesized using one of three excitation types: periodic pitch pulses, random noise, or multipulse. By using multipulse excitation it is possible to accurately produce speech which is difficult to model using noise and pitch pulses alone [1]. Since multipulse is only used where appropriate, efficient, sub-optimal methods of calculating the pulse amplitudes and positions are adequate, simplifying the implementation into a real-time system. The synthetic speech may be coded at 4.8 kb/s since multipulse, used only where appropriate, suffers little quality loss when quantized. A method of determining which excitation type is to be used is discussed. Formal listening test results are also presented.Keywords
This publication has 5 references indexed in Scilit:
- A new model of LPC excitation for producing natural-sounding speech at low bit ratesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Fast and accurate pitch detection using pattern recognition and adaptive time-domain analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A Subjective Comparison of Selected Digital Codecs for SpeechBell System Technical Journal, 1978
- Linear Prediction of SpeechPublished by Springer Nature ,1976
- Parallel Processing Techniques for Estimating Pitch Periods of Speech in the Time DomainThe Journal of the Acoustical Society of America, 1969