Analysis and automatic recognition of false starts in spontaneous speech
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 724-727 vol.2
- https://doi.org/10.1109/icassp.1993.319414
Abstract
The author describes the extent of prosodic phenomena in speech restarts in a multispeaker database of spontaneous, continuous speech, and gives intuitive explanations for them, based on a theory of using prosodics to cue semantic information to a listener. He also gives details (based on the acoustic data) on how to attempt to recognize these phenomena in the context of an automatic speech recognizer. It is shown that simple restarts (i.e., those without inserted or substituted words) could be distinguished acoustically, via an analysis of duration, F0, and spectral detail in the neighborhood of a pause. Including some current refinements, the author expects to be able to identify automatically such restarts with an accuracy exceeding 80%, while keeping false alarms to below 15%. Restarts with changed words may be distinguishable, but the required analysis will need to be much more complex, and beyond the scope of the present work.Keywords
This publication has 5 references indexed in Scilit:
- Automatic detection and correction of repairs in human-computer dialogPublished by Association for Computational Linguistics (ACL) ,1992
- Understanding spontaneous speech: the Phoenix systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- A Content-Processing View of Hesitation PhenomenaLanguage and Speech, 1981
- Analyse des variables temporelles du français spontanéPhonetica, 1973
- The SIFT algorithm for fundamental frequency estimationIEEE Transactions on Audio and Electroacoustics, 1972