A hybrid neural network, dynamic programming word spotter
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 77-80 vol.2
- https://doi.org/10.1109/icassp.1992.226116
Abstract
A novel keyword-spotting system that combines both neural network and dynamic programming techniques is presented. This system makes use of the strengths of time delay neural networks (TDNNs), which include strong generalization ability, potential for parallel implementations, robustness to noise, and time shift invariant learning. Dynamic programming models are used by this system because they have the useful capability of time warping input speech patterns. This system was trained and tested on the Stonehenge Road Rally database, which is a 20-keyword-vocabulary, speaker-independent, continuous-speech corpus. Currently, this system performs at a figure of merit (FOM) rate of 82.5%. FOM is the detection rate averaged from 0 to 10 false alarms per keyword hour. This measure is explained in detail.Keywords
This publication has 4 references indexed in Scilit:
- Consonant recognition by modular construction of large phonemic time-delay neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A hidden Markov model based keyword recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Multiple neural network topologies applied to keyword spottingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Improvements and applications for key word recognition using hidden Markov modeling techniquesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991