Effects of rate and pitch variations on the intelligibility of synthesized speech
- 1 January 1991
- journal article
- research article
- Published by Taylor & Francis in Augmentative and Alternative Communication
- Vol. 7 (4) , 284-289
- https://doi.org/10.1080/07434619112331276023
Abstract
The present study investigated the effect of pitch and rate variations on the intelligibility of Echo-II produced speech. The results show that slowing down the rate of text-to-speech synthesized speech from the default value of about 201 syllables per minute to 139 syllables per minute enhances word intelligibility by more than 10 percent and message intelligibility by about 14 percent. High (194 Hz), medium (111 Hz), and low (82 Hz) pitch levels produced roughly equivalent intelligibility scores. Augmentative and alternative communication and educational applications of text-tospeech synthesis need to permit users the flexibility of varying rate of speech to enhance intelligibility. Clinicians and teachers also need to note that message intelligibility is still only 58 percent with a slow rate of speech.Keywords
This publication has 5 references indexed in Scilit:
- Speaking Rates of Young ChildrenLanguage, Speech, and Hearing Services in Schools, 1989
- The Intelligibility of Synthesized SpeechJournal of Speech, Language, and Hearing Research, 1987
- A comparison of speech synthesis intelligibility with listeners from three age groupsAugmentative and Alternative Communication, 1987
- Frequency of Word Occurbence in Communication Samples Produced by Adult Communication Aid UsersJournal of Speech and Hearing Disorders, 1984
- Statistical principles in experimental design.Published by American Psychological Association (APA) ,1962