A text-to-speech system for Spanish with a frequency domain based prosodic modification algorithm

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 183-186 vol.2
https://doi.org/10.1109/icassp.1993.319264

Abstract

From the input text, the linguistic-prosodic module obtains the phonetic transcription and prosodic marks that reflect both the syntactic structure and some rhythmical constraints. The synthesis module is a variation of the MBE (multiband excitation) vocoder with an LPC (linear predictive coding) filter that is very flexible for prosodic modifications. From a parametrized acoustic database, the algorithm decodes the speech units and modifies their prosody in a single process. The frequency baseness of the synthesis algorithm allows a fine pitch modification without spectral envelope distortion. The prosody modeling is done using the acoustic module by a close copy stylization method.

Keywords

This publication has 2 references indexed in Scilit:

Multi-band vector excitation coding of speech at 4.8 kbps
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Multiband excitation vocoder
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988