Robust speech recognition based on stochastic matching
- 19 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (15206149) , 121-124
- https://doi.org/10.1109/icassp.1995.479288
Abstract
We present a maximum likelihood (ML) stochastic matching approach to decrease the acoustic mismatch between a test utterance Y and a given set of speech hidden Markov models /spl Lambda//sub X/ so as to reduce the recognition performance degradation caused by possible distortions in the test utterance. This mismatch may be reduced in two ways: (1) by an inverse distortion function F/sub /spl nu//(.) that maps Y into an utterance X which matches better with the models /spl Lambda//sub X/, and (2) by a model transformation function G/sub /spl eta//(.) that maps /spl Lambda//sub X/ to the transformed model /spl Lambda//sub Y/ which matches better with the utterance Y. The functional form of the transformations depends upon our prior knowledge about the mismatch, and the parameters are estimated along with the recognized string in a maximum likelihood manner using the EM algorithm. Experimental results verify the efficacy of the approach in improving the performance of a continuous speech recognition system in the presence of mismatch due to different transducers and transmission channels.Keywords
This publication has 11 references indexed in Scilit:
- Unsupervised speaker adaptation by probabilistic spectrum fittingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Signal bias removal for robust telephone based speech recognition in adverse environmentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A maximum-likelihood approach to stochastic matching for robust speech recognitionIEEE Transactions on Speech and Audio Processing, 1996
- Stochastic matching for robust speech recognitionIEEE Signal Processing Letters, 1994
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chainsIEEE Transactions on Speech and Audio Processing, 1994
- Integrated models of signal and background with application to speaker identification in noiseIEEE Transactions on Speech and Audio Processing, 1994
- A new speaker adaptation technique using very short calibration speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Speech recognition in adverse environmentsComputer Speech & Language, 1991
- Acoustic modeling for large vocabulary speech recognitionComputer Speech & Language, 1990
- Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov ChainsAT&T Technical Journal, 1985