An improved search algorithm using incremental knowledge for continuous speech recognition

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 307-310 vol.2
https://doi.org/10.1109/icassp.1993.319298

Abstract

A search algorithm that incrementally makes effective use of detailed sources of knowledge is proposed. The algorithm incrementally applies all available acoustic and linguistic information in three search phases. Phase one is a left-to-right Viterbi beam search that produces word end times and scores using right context between-word models with a bigram language model. Phase two, guided by results from phase one, is a right-to-left Viterbi beam search that produces word begin times and scores based on left context between-word models. Phase three is an A* search that combines the results of phases one and two with a long-distance language model. The objective is to maximize the recognition accuracy with a minimal increase in computational cost. With the decomposed, incremental, search algorithm, it is shown that early use of detailed acoustic models can significantly reduce the recognition error rate with a negligible increase in computational cost. It is demonstrated that the early use of detailed knowledge can improve the word error bound by at least 22% for large-vocabulary, speaker-independent, continuous speech recognition.

Keywords

This publication has 8 references indexed in Scilit:

Obtaining candidate words by polling in a large vocabulary speech recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypotheses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
The SPHINX-II speech recognition system: an overview
Computer Speech & Language, 1993
Phoneme classification using semicontinuous hidden Markov models
IEEE Transactions on Signal Processing, 1992
An Efficient A* Stack Decoder Algorithm for Continuous Speech Recognition with a Stochastic Language Model.
Published by Defense Technical Information Center (DTIC) ,1991
Recent progress on the VOYAGER system
Published by Association for Computational Linguistics (ACL) ,1990
A Maximum Likelihood Approach to Continuous Speech Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1983
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
IEEE Transactions on Information Theory, 1967