Phonetic training and language modeling for word spotting

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 459-462 vol.2
https://doi.org/10.1109/icassp.1993.319340

Abstract

The authors present a view of HMM (hidden Markov model)-based word spotting systems as described by three main components: the HMM acoustic model; the overall HMM structure, including nonkeyword modeling; and the keyword scoring method. They investigate and present comparative results for various approaches to each of these components and show that design choices for these components can be addressed separately. They also present a novel approach to word spotting that combines phonetic training, large vocabulary modeling, and statistical language modeling with a posterior probability approach to keyword scoring. They perform word spotting experiments using telephone quality conversational speech from the Switchboard corpus to examine the effect of different design choices for the three components and demonstrate that the proposed approach provides superior performance to previously used techniques.<>

Keywords

This publication has 6 references indexed in Scilit:

Continuous hidden Markov modeling for speaker-independent word spotting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Continuous speech recognition results of the BYBLOS system on the DARPA 1000-word resource management database
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A hidden Markov model based keyword recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
SWITCHBOARD: telephone speech corpus for research and development
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
The forward-backward search algorithm
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Automatic recognition of keywords in unconstrained speech using hidden Markov models
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1990