Continuous speech recognition based on high plausibility regions
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15206149,p. 725-728 vol.1
- https://doi.org/10.1109/icassp.1991.150442
Abstract
The authors propose an approach to phoneme-based continuous speech recognition when a time function of the plausibility of observing each phoneme (spotting result) is given. They introduce a criterion for the best sentence, based on the sum of plausibilities of individual symbols composing the sentence. Based on the idea of making use of high plausibility regions to reduce the computational load while maintaining optimality, the method finds the most plausible sentences relating to the input speech. Two optimization procedures are defined to deal with the following embedded search processes: (1) finding the best path connecting peaks of the plausibility functions of two successive symbols, and (2) finding the best time transition slot index for two given peaks. Experimental results show that the method gives better recognition precision while requiring about 1/20 of the computing time of the traditional DP-based methods. The experimental system obtained a 95% sentence recognition rate on a multispeaker test.<>Keywords
This publication has 8 references indexed in Scilit:
- Spotting Japanese CV-syllables and phonemes using the time-delay neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Connectionist Viterbi training: a new hybrid method for continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Large vocabulary recognition using linked predictive neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Continuous speech recognition using multilayer perceptrons with hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Signal-to-string conversion based on high likelihood regions using embedded dynamic programmingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Neural network coupled with IIR sequential adapter for phoneme recognition in continuous speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Non-linear vector interpolation by neural network for phoneme identification in continuous speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- The viterbi algorithmProceedings of the IEEE, 1973