Robust speech recognition based on stochastic matching

19 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15206149) , 121-124
https://doi.org/10.1109/icassp.1995.479288

Abstract

We present a maximum likelihood (ML) stochastic matching approach to decrease the acoustic mismatch between a test utterance Y and a given set of speech hidden Markov models /spl Lambda//sub X/ so as to reduce the recognition performance degradation caused by possible distortions in the test utterance. This mismatch may be reduced in two ways: (1) by an inverse distortion function F/sub /spl nu//(.) that maps Y into an utterance X which matches better with the models /spl Lambda//sub X/, and (2) by a model transformation function G/sub /spl eta//(.) that maps /spl Lambda//sub X/ to the transformed model /spl Lambda//sub Y/ which matches better with the utterance Y. The functional form of the transformations depends upon our prior knowledge about the mismatch, and the parameters are estimated along with the recognized string in a maximum likelihood manner using the EM algorithm. Experimental results verify the efficacy of the approach in improving the performance of a continuous speech recognition system in the presence of mismatch due to different transducers and transmission channels.

Keywords

This publication has 11 references indexed in Scilit:

Unsupervised speaker adaptation by probabilistic spectrum fitting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Signal bias removal for robust telephone based speech recognition in adverse environments
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A maximum-likelihood approach to stochastic matching for robust speech recognition
IEEE Transactions on Speech and Audio Processing, 1996
Stochastic matching for robust speech recognition
IEEE Signal Processing Letters, 1994
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
IEEE Transactions on Speech and Audio Processing, 1994
Integrated models of signal and background with application to speaker identification in noise
IEEE Transactions on Speech and Audio Processing, 1994
A new speaker adaptation technique using very short calibration speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
Speech recognition in adverse environments
Computer Speech & Language, 1991
Acoustic modeling for large vocabulary speech recognition
Computer Speech & Language, 1990
Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov Chains
AT&T Technical Journal, 1985