Training and search algorithms for an interactive wordspotting system

1 January 1992

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 97-100 vol.2
https://doi.org/10.1109/icassp.1992.226111

Abstract

Algorithms for a speaker-dependent wordspotting system based on hidden Markov models (HMMs) are described. The system allows a user to specify keywords dynamically and to train the associated HMMs via a single repetition of a keyword. Nonkeyword speech is modeled using an HMM trained from a prerecorded sample of continuous speech. The wordspotter is intended for interactive applications, such as the editing of voice mail or mixed-media documents, and for keyword indexing in audio or video recordings. The forward-backward search algorithm used in the wordspotter is compared with the Viterbi decoder on the basis of speed and accuracy. In addition, an algorithm for speaker adaptation is described which allows indexing by a user into another talker's speech.

Keywords

This publication has 12 references indexed in Scilit:

Partial traceback and dynamic programming
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Unsupervised speaker adaptation method based on hierarchical spectral clustering
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Continuous hidden Markov modeling for speaker-independent word spotting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
The DARPA 1000-word resource management database for continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Acoustic Markov models used in the Tangora speech recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A hidden Markov model based keyword recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Automatic recognition of keywords in unconstrained speech using hidden Markov models
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
A tutorial on hidden Markov models and selected applications in speech recognition
Proceedings of the IEEE, 1989
Vector quantization
IEEE ASSP Magazine, 1984
Optimal Fuzzy Partitions: A Heuristic for Estimating the Parameters in a Mixture of Normal Distributions
IEEE Transactions on Computers, 1975