Speech discrimination in adverse conditions using acoustic knowledge and selectively trained neural networks
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 279-282 vol.2
- https://doi.org/10.1109/icassp.1993.319290
Abstract
It is demonstrated that the STNN (selectively trained neural network) method improves confusable work discrimination. Tests conducted on clean and Lombard-noisy speech show that using only a small part (two frames) of the work where useful information for discrimination is located is more efficient than taking into account the whole word. Recognition scores obtained with a continuous-density HMM (hidden Markov model) are lower than those obtained with the proposed method. The present results show an increase in recognition accuracy for the tests on Lombard-noisy speech when the training is done on clean, Lombard, and Lombard-noisy speech. Furthermore, if the same noise is used for the training and the test, the STNN performances improve far more than those of the HMM. The STNN method does not need any precise detection of word boundaries. This influences the robustness of the method, especially in noisy conditions.Keywords
This publication has 5 references indexed in Scilit:
- Towards a general signal interpretation system-signal-to-symbol conversion levelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Speaker-independent vowel recognition: spectrograms versus cochleagramsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A time-delay neural network architecture for isolated word recognitionNeural Networks, 1990
- Cepstral domain talker stress compensation for robust speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1988
- Low-dimensional representation of vowels based on all-pole modeling in the psychophysical domainSpeech Communication, 1985