Robust speech recognition based on joint model and feature space optimization of hidden Markov models
- 1 March 1997
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 8 (2) , 194-204
- https://doi.org/10.1109/72.557656
Abstract
The hidden Markov model (HMM) inversion algorithm, based on either the gradient search or the Baum-Welch reestimation of input speech features, is proposed and applied to the robust speech recognition tasks under general types of mismatch conditions. This algorithm stems from the gradient-based inversion algorithm of an artificial neural network (ANN) by viewing an HMM as a special type of ANN. Given input speech features s, the forward training of an HMM finds the model parameters /spl lambda/ subject to an optimization criterion. On the other hand, the inversion of an HMM finds speech features, s, subject to an optimization criterion with given model parameters /spl lambda/. The gradient-based HMM inversion and the Baum-Welch HMM inversion algorithms can be successfully integrated with the model space optimization techniques, such as the robust MINIMAX technique, to compensate the mismatch in the joint model and feature space. The joint space mismatch compensation technique achieves better performance than the single space, i.e. either the model space or the feature space alone, mismatch compensation techniques. It is also demonstrated that approximately 10-dB signal-to-noise ratio (SNR) gain is obtained in the low SNR environments when the joint model and feature space mismatch compensation technique is used.Keywords
This publication has 26 references indexed in Scilit:
- Noise reduction using connectionist modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Interactive query learning for isolated speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Nonlinear Control SystemsPublished by Springer Nature ,1995
- Discriminative learning for minimum error classification (pattern recognition)IEEE Transactions on Signal Processing, 1992
- Hidden Markov models with first-order equalization for noisy speech recognitionIEEE Transactions on Signal Processing, 1992
- Speech recognition in adverse environmentsComputer Speech & Language, 1991
- Query-based learning applied to partially trained multilayer perceptronsIEEE Transactions on Neural Networks, 1991
- Learning in Artificial Neural Networks: A Statistical PerspectiveNeural Computation, 1989
- A systolic neural network architecture for hidden Markov modelsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Maximum-Likelihood Estimation for Mixture Multivariate Stochastic Observations of Markov ChainsAT&T Technical Journal, 1985