Intonation in text-to-speech synthesis: Evaluation of algorithms

1 June 1985

journal article
Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America

Vol. 77 (6) , 2157-2165
https://doi.org/10.1121/1.391739

Abstract

Two algorithms, termed schematic and naturalistic, for generating intonation contours in an English text-to-speech system are compared by eliciting preference judgments from a total of 21 subjects. The major problem for both algorithms, but especially for the schematic algorithm, has to do with accent assignment and with the determination of the intonation phrase rather than with the phonetic realization of accent through manipulation of F0. Due to parser errors, phrase boundaries are incorrectly identified in 30% of the sentences used in the three experiments. Moreover, the naturalistic algorithm uses a grammatical part-of-speech hierarchy which ranks nouns higher than verbs. Therefore, incorrect classification of verbs as nouns (the major classification error) results in an unintended accent. The results indicate that accent assignment and phrase determination are the primary areas requiring improvement in order to further increase the naturalness of synthetic speech intonation.

This publication has 0 references indexed in Scilit: