A new model of LPC excitation for producing natural-sounding speech at low bit rates
Top Cited Papers
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 7, 614-617
- https://doi.org/10.1109/icassp.1982.1171649
Abstract
The excitation for LPC speech synthesis usually consists of two separate signals - a delta-function pulse once every pitch period for voiced speech and white noise for unvoiced speech. This manner of representing excitation requires that speech segments be classified accurately into voiced and unvoiced categories and the pitch period of voiced segments be known. It is now well recognized that such a rigid idealization of the vocal excitation is often responsible for the unnatural quality associated with synthesized speech. This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period. All classes of sounds are generated by exciting the LPC filter with a sequence of pulses; the amplitudes and locations of the pulses are determined using a non-iterative analysis-by-synthesis procedure. This procedure minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals. The distance metric takes account of the finite-frequency resolution as well as the differential sensitivity of the human ear to errors in the formant and inter-formant regions of the speech spectrum.Keywords
This publication has 10 references indexed in Scilit:
- Formant excitation before and after glottal closurePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Optimizing digital speech coders by exploiting masking properties of the human earThe Journal of the Acoustical Society of America, 1979
- Frequency domain coding of speechIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Predictive coding of speech signals and subjective error criteriaIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Linear Prediction of SpeechPublished by Springer Nature ,1976
- Digital coding of speech waveforms: PCM, DPCM, and DM quantizersProceedings of the IEEE, 1974
- A linear prediction vocoder simulation based upon the autocorrelation methodIEEE Transactions on Acoustics, Speech, and Signal Processing, 1974
- Speech Analysis Synthesis and PerceptionPublished by Springer Nature ,1972
- Speech Analysis and Synthesis by Linear Prediction of the Speech WaveThe Journal of the Acoustical Society of America, 1971
- Vocoders: Analysis and synthesis of speechProceedings of the IEEE, 1966