Performance improvement in a dynamic-programming-based isolated word recognition system for the alpha-digit task
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 7, 558-561
- https://doi.org/10.1109/icassp.1982.1171769
Abstract
In isolated word recognition, the alpha-digit vocabulary has been recognized as one of the most difficult due to the acoustic similarities of the lexical entries. The purpose of this study is two-fold: a) To see how a dynamic programming approach can be augmented with phonetic information to improve recognition accuracy of the alpha-digits; and b) To minimize computational requirements for the recognition task. Performance improvement is accomplished by dividing the vocabulary into subsets based on the syllabic patterns of the words and by emphasizing the consonant-vowel transitional regions of the words. This algorithm has been tested on 10 speakers, 5 male and 5 female. The division of the vocabulary results in a substantial savings in computation with essentially no decrease in recognition accuracy. In addition to computational savings, emphasizing the transitional portions of the word in some cases results in accuracy improvement. Discussion of the results and suggestions for further improvement are presented.Keywords
This publication has 6 references indexed in Scilit:
- Isolated word recognition using a two-pass pattern recognition approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Using zero crossing counts to provide discriminative information in isolated word recognitionThe Journal of the Acoustical Society of America, 1981
- An improved endpoint detector for isolated word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
- Computational cost of DP algorithms in speech recognitionThe Journal of the Acoustical Society of America, 1981
- Considerations in dynamic time warping algorithms for discrete word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
- Minimum prediction residual principle applied to speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975