An improved search algorithm using incremental knowledge for continuous speech recognition
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 307-310 vol.2
- https://doi.org/10.1109/icassp.1993.319298
Abstract
A search algorithm that incrementally makes effective use of detailed sources of knowledge is proposed. The algorithm incrementally applies all available acoustic and linguistic information in three search phases. Phase one is a left-to-right Viterbi beam search that produces word end times and scores using right context between-word models with a bigram language model. Phase two, guided by results from phase one, is a right-to-left Viterbi beam search that produces word begin times and scores based on left context between-word models. Phase three is an A* search that combines the results of phases one and two with a long-distance language model. The objective is to maximize the recognition accuracy with a minimal increase in computational cost. With the decomposed, incremental, search algorithm, it is shown that early use of detailed acoustic models can significantly reduce the recognition error rate with a negligible increase in computational cost. It is demonstrated that the early use of detailed knowledge can improve the word error bound by at least 22% for large-vocabulary, speaker-independent, continuous speech recognition.Keywords
This publication has 8 references indexed in Scilit:
- Obtaining candidate words by polling in a large vocabulary speech recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypothesesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The SPHINX-II speech recognition system: an overviewComputer Speech & Language, 1993
- Phoneme classification using semicontinuous hidden Markov modelsIEEE Transactions on Signal Processing, 1992
- An Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model.Published by Defense Technical Information Center (DTIC) ,1991
- Recent progress on the VOYAGER systemPublished by Association for Computational Linguistics (ACL) ,1990
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- Error bounds for convolutional codes and an asymptotically optimum decoding algorithmIEEE Transactions on Information Theory, 1967