Phonetic training and language modeling for word spotting
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 459-462 vol.2
- https://doi.org/10.1109/icassp.1993.319340
Abstract
The authors present a view of HMM (hidden Markov model)-based word spotting systems as described by three main components: the HMM acoustic model; the overall HMM structure, including nonkeyword modeling; and the keyword scoring method. They investigate and present comparative results for various approaches to each of these components and show that design choices for these components can be addressed separately. They also present a novel approach to word spotting that combines phonetic training, large vocabulary modeling, and statistical language modeling with a posterior probability approach to keyword scoring. They perform word spotting experiments using telephone quality conversational speech from the Switchboard corpus to examine the effect of different design choices for the three components and demonstrate that the proposed approach provides superior performance to previously used techniques.<>Keywords
This publication has 6 references indexed in Scilit:
- Continuous hidden Markov modeling for speaker-independent word spottingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Continuous speech recognition results of the BYBLOS system on the DARPA 1000-word resource management databasePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A hidden Markov model based keyword recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- SWITCHBOARD: telephone speech corpus for research and developmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- The forward-backward search algorithmPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Automatic recognition of keywords in unconstrained speech using hidden Markov modelsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990