Efficient decoding and training procedures for utterance verification in continuous speech recognition
- 24 December 2002
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 507-510
- https://doi.org/10.1109/icassp.1996.541144
Abstract
It is often necessary in speech recognition to include a mechanism for verifying decoded utterances in order to account for incorrectly decoded vocabulary words and utterances corresponding to words or sounds that are not included in a prespecified lexicon. This paper describes an utterance verification procedure for hidden Markov model (HMM) based continuous speech recognition that is based on a likelihood ratio (LR) criterion. There are two important contributions. The first is a search algorithm which directly optimizes a likelihood ratio criterion. This search algorithm is important because it allows decoding to be performed in speech recognition according to the same measure of confidence that is used in hypothesis testing. The second contribution is a corresponding training procedure for estimating model parameters which also directly optimizes the same likelihood ratio criterion. These techniques are applied to spontaneous spoken queries in the context of a "movie locator" dialog system.Keywords
This publication has 2 references indexed in Scilit:
- Continuous Speech Interface for a Movie Locator ServiceProceedings of the Human Factors and Ergonomics Society Annual Meeting, 1995
- A vocabulary independent discriminatively trained method for rejection of non-keywords in sub word based speech recognitionPublished by International Speech Communication Association ,1995