Continuous speech recognition based on high plausibility regions

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 725-728 vol.1
https://doi.org/10.1109/icassp.1991.150442

Abstract

The authors propose an approach to phoneme-based continuous speech recognition when a time function of the plausibility of observing each phoneme (spotting result) is given. They introduce a criterion for the best sentence, based on the sum of plausibilities of individual symbols composing the sentence. Based on the idea of making use of high plausibility regions to reduce the computational load while maintaining optimality, the method finds the most plausible sentences relating to the input speech. Two optimization procedures are defined to deal with the following embedded search processes: (1) finding the best path connecting peaks of the plausibility functions of two successive symbols, and (2) finding the best time transition slot index for two given peaks. Experimental results show that the method gives better recognition precision while requiring about 1/20 of the computing time of the traditional DP-based methods. The experimental system obtained a 95% sentence recognition rate on a multispeaker test.<>

Keywords

This publication has 8 references indexed in Scilit:

Spotting Japanese CV-syllables and phonemes using the time-delay neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Connectionist Viterbi training: a new hybrid method for continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Large vocabulary recognition using linked predictive neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Continuous speech recognition using multilayer perceptrons with hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Signal-to-string conversion based on high likelihood regions using embedded dynamic programming
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Neural network coupled with IIR sequential adapter for phoneme recognition in continuous speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Non-linear vector interpolation by neural network for phoneme identification in continuous speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
The viterbi algorithm
Proceedings of the IEEE, 1973