A text-to-speech system for Spanish with a frequency domain based prosodic modification algorithm
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 183-186 vol.2
- https://doi.org/10.1109/icassp.1993.319264
Abstract
From the input text, the linguistic-prosodic module obtains the phonetic transcription and prosodic marks that reflect both the syntactic structure and some rhythmical constraints. The synthesis module is a variation of the MBE (multiband excitation) vocoder with an LPC (linear predictive coding) filter that is very flexible for prosodic modifications. From a parametrized acoustic database, the algorithm decodes the speech units and modifies their prosody in a single process. The frequency baseness of the synthesis algorithm allows a fine pitch modification without spectral envelope distortion. The prosody modeling is done using the acoustic module by a close copy stylization method.Keywords
This publication has 2 references indexed in Scilit:
- Multi-band vector excitation coding of speech at 4.8 kbpsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Multiband excitation vocoderIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988