Multilingual PSOLA text-to-speech system

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 187-190 vol.2
https://doi.org/10.1109/icassp.1993.319265

Abstract

Work done at CNET on developing multilingual concatenation-based PSOLA TTS (text-to-speech) systems well adapted to their use in interactive voice services is reviewed. A new system architecture SYC (synthesis control) has been specifically designed for assuring real-time, multichannel, and interactive operation. The data control and transfer methods within the system are now fully independent of the content of the modules, allowing for an easy adaptation of the system to multilingual operation. A complete version has been developed for German, and mixed systems have been elaborated for Italian and English in collaboration with other laboratories, using the same PSOLA synthesizer as for French. The overall quality of the French version has been improved both in naturalness (optimization of the linguistico-prosodic processings) and in articulation accuracy (use of longer units to be concatenated). An automatic segmentation procedure has been developed for the rapid building of new repertories of speech from recordings by new speakers, i.e., for the creation of new synthetic voices.<>

Keywords

This publication has 6 references indexed in Scilit:

A diphone synthesis system based on time-domain prosodic modifications of speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Automatic generation of synthesis units based on context oriented clustering
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A real-time French text-to-speech system generating high-quality synthetic speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Concatenative speech synthesis by minimum distortion criteria
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Use of the magnitude estimation technique for assessing the performance of text-to-speech synthesis systems
The Journal of the Acoustical Society of America, 1990
Linguistic and prosodic processing for a text-to-speech synthesis system
Published by International Speech Communication Association ,1989