Multiple neural network topologies applied to keyword spotting
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15206149,p. 313-316 vol.1
- https://doi.org/10.1109/icassp.1991.150339
Abstract
The authors describe several experiments in which the use of artificial neural networks (ANNs) for the continuous speech speaker-independent keyword recognition problem was investigated. They discuss methodologies for reducing a primary keyword spotting system's susceptibility to false alarms while maintaining recognition accuracy. The keyword spotter uses a conventional dynamic time warping algorithm to detect the start- and end-point of each potential keyword. The ANNs serve as a secondary processing stage for this segmented utterance. The ANNs attempt to classify this utterance by formulating the recognition problem as a pattern matching problem. In the hybrid network experiments, the utterance was processed into features derived from the activation at the hidden layer of a back-propagation trained network. Hybrid representations were grouped with two other feature representations in a multiple neural network system. A recognition accuracy of 78% on the Stonehenge X database was obtained while rejecting 72% of the false alarms which were detected by the primary keyword spotting system.Keywords
This publication has 6 references indexed in Scilit:
- On the use of neural networks for speaker independent isolated word recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A keyword spotter which incorporates neural networks for secondary processingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Neural Networks and Speech ProcessingPublished by Springer Nature ,1991
- Learning representations by back-propagating errorsNature, 1986
- A level building dynamic time warping algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
- Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979