Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2 (15206149) , 279-282 vol.2
https://doi.org/10.1109/icassp.1993.319290

Abstract

It is demonstrated that the STNN (selectively trained neural network) method improves confusable work discrimination. Tests conducted on clean and Lombard-noisy speech show that using only a small part (two frames) of the work where useful information for discrimination is located is more efficient than taking into account the whole word. Recognition scores obtained with a continuous-density HMM (hidden Markov model) are lower than those obtained with the proposed method. The present results show an increase in recognition accuracy for the tests on Lombard-noisy speech when the training is done on clean, Lombard, and Lombard-noisy speech. Furthermore, if the same noise is used for the training and the test, the STNN performances improve far more than those of the HMM. The STNN method does not need any precise detection of word boundaries. This influences the robustness of the method, especially in noisy conditions.

Keywords

This publication has 5 references indexed in Scilit:

Towards a general signal interpretation system-signal-to-symbol conversion level
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Speaker-independent vowel recognition: spectrograms versus cochleagrams
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A time-delay neural network architecture for isolated word recognition
Neural Networks, 1990
Cepstral domain talker stress compensation for robust speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domain
Speech Communication, 1985