Environmental robustness in automatic speech recognition
- 4 December 2002
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 27, 849-852
- https://doi.org/10.1109/icassp.1990.115971
Abstract
In this paper we report our initial efforts to make SPHINX, the CMU continuous-speech speaker-independent recognition system, robust to changes in the environment. To deal with differences in noise level and spectral tilt between closc-tcking atid desk-top microphones, we propose two novel methods based on additive corrections in the cepstral domain. In the first algorithm, the additive correction depends on the instantaneous SNR of the signal. In the second technique, EM techniques are used to bes~ match the cepstral vectors of the input utter.mces to the ensemble of codebook entries representing a standard acoustical ambience. Use of the proposed algorithms dramatically improves recognition accuracy when the system is tested on a microphone other than the one on which it was trained. plicitly. In this paper we present two algorithms for speech normalization based on additive corrections in the cepstral domain. We have chosen the cepstral domain rather than the frequency domain so that we work directly with the parameters that SPHINX uses, and because speech can be characterized with a smaller number of parameters in the cepstral domain than in the frequency domain. The first algorithm, SNR-deperidenf cepstral normalization (SDCN) is simple and effective, but it cannot be applied to new microphones without microphone-specific training. The second algorithm, codeword-deperident cepstral norntalizution (CDCN) computes an ML estimate for the noise and spectral tilt, and then an MMSE estimate for the speech cepstrum. These algorithms are evaluated using an alphanumeric database in which utterances were recorded simultaneously with two different microphones.Keywords
This publication has 7 references indexed in Scilit:
- Optimal estimators for spectral restoration of noisy speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- The SPHINX speech recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Noise adaptation in a hidden Markov model speech recognition systemComputer Speech & Language, 1989
- Acoustical pre-processing for robust speech recognitionPublished by Association for Computational Linguistics (ACL) ,1989
- Spectral estimation for noise robust speech recognitionPublished by Association for Computational Linguistics (ACL) ,1989
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Blind deconvolution through digital signal processingProceedings of the IEEE, 1975