Diphone synthesis of French: Vocal response unit and automatic prosody from the text.

24 March 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 560-563
https://doi.org/10.1109/icassp.1977.1170209

Abstract

Diphone synthesis has been first introduced in France by LEIPP and al. in 1967 in relation with a perceptive theory describing the speech structures. An intelligible, monotonous voice was synthesized by means of a 44 oscillator device named ICOPHONE, from a lexicon of some 600 normalized diphones. As a result, the autonomous vocal response unit ICOPHONE 5, operational since 1974, produces fluent French in real-time from the text written in orthographic or phonetic form. Vocal response, including prosody, should be entirely automatic, and work even with non-grammatical sentences. An algorithm has been written with respect to these considerations : pitch and duration are deduced from the text, without any syntax analysis or manual marking. Present results confirm the idea that syntax is not the essential factor governing prosodic contours.

Keywords

This publication has 5 references indexed in Scilit:

Acoustic-phonetic recognition of connected speech using transient information
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Speech synthesis by dyads and automatic intonation processing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Terminal analog synthesis of continuous speech using the diphone method of segment assembly
IEEE Transactions on Audio and Electroacoustics, 1968
Segmentation Techniques in Speech Synthesis
The Journal of the Acoustical Society of America, 1958
The Interconversion of Audible and Visible Patterns as a Basis for Research in the Perception of Speech
Proceedings of the National Academy of Sciences, 1951