Training and search algorithms for an interactive wordspotting system
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 97-100 vol.2
- https://doi.org/10.1109/icassp.1992.226111
Abstract
Algorithms for a speaker-dependent wordspotting system based on hidden Markov models (HMMs) are described. The system allows a user to specify keywords dynamically and to train the associated HMMs via a single repetition of a keyword. Nonkeyword speech is modeled using an HMM trained from a prerecorded sample of continuous speech. The wordspotter is intended for interactive applications, such as the editing of voice mail or mixed-media documents, and for keyword indexing in audio or video recordings. The forward-backward search algorithm used in the wordspotter is compared with the Viterbi decoder on the basis of speed and accuracy. In addition, an algorithm for speaker adaptation is described which allows indexing by a user into another talker's speech.Keywords
This publication has 12 references indexed in Scilit:
- Partial traceback and dynamic programmingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Unsupervised speaker adaptation method based on hierarchical spectral clusteringPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Continuous hidden Markov modeling for speaker-independent word spottingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The DARPA 1000-word resource management database for continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Acoustic Markov models used in the Tangora speech recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A hidden Markov model based keyword recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Automatic recognition of keywords in unconstrained speech using hidden Markov modelsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989
- Vector quantizationIEEE ASSP Magazine, 1984
- Optimal Fuzzy Partitions: A Heuristic for Estimating the Parameters in a Mixture of Normal DistributionsIEEE Transactions on Computers, 1975