Synthesis-based recognition of continuous speech

1 April 1982

journal article
research article
Published by Acoustical Society of America (ASA) in The Journal of the Acoustical Society of America

Vol. 71 (4) , 1016-1024
https://doi.org/10.1121/1.387653

Abstract

An acoustic phonemic recognition system for continuous [human] speech is presented. The system uses both steady state and transition segments of the speech signal to achieve recognition. The information contained in formant transitions is used by the system by using a synthesis-based recognition approach. This improves the performance of the system considerably. Recognition of continuous speech is accomplished here in 3 stages: segmentation, steady-state recognition and synthesis-based recognition. The system was tried out on 40 test utterances, each 3-4 s in duration, spoken by a single male speaker and the following results are obtained: 5.4% missed segment error, 8.3% extra segment error, 52.3% correct recognition using only steady-state segments and 62.0% correct recognition using both steady-state and transition segments.

This publication has 1 reference indexed in Scilit:

On the Rôle of Formant Transitions in Vowel Recognition
The Journal of the Acoustical Society of America, 1967