Multiple neural network topologies applied to keyword spotting

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 313-316 vol.1
https://doi.org/10.1109/icassp.1991.150339

Abstract

The authors describe several experiments in which the use of artificial neural networks (ANNs) for the continuous speech speaker-independent keyword recognition problem was investigated. They discuss methodologies for reducing a primary keyword spotting system's susceptibility to false alarms while maintaining recognition accuracy. The keyword spotter uses a conventional dynamic time warping algorithm to detect the start- and end-point of each potential keyword. The ANNs serve as a secondary processing stage for this segmented utterance. The ANNs attempt to classify this utterance by formulating the recognition problem as a pattern matching problem. In the hybrid network experiments, the utterance was processed into features derived from the activation at the hidden layer of a back-propagation trained network. Hybrid representations were grouped with two other feature representations in a multiple neural network system. A recognition accuracy of 78% on the Stonehenge X database was obtained while rejecting 72% of the false alarms which were detected by the primary keyword spotting system.

Keywords

This publication has 6 references indexed in Scilit:

On the use of neural networks for speaker independent isolated word recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A keyword spotter which incorporates neural networks for secondary processing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Neural Networks and Speech Processing
Published by Springer Nature ,1991
Learning representations by back-propagating errors
Nature, 1986
A level building dynamic time warping algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979