Hidden Markov models with first-order equalization for noisy speech recognition
- 1 January 1992
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing
- Vol. 40 (9) , 2136-2143
- https://doi.org/10.1109/78.157214
Abstract
Speech recognizers often experience serious performance degradation when deployed in an unknown acoustic (particularly, noise contaminated) environment. To combat this problem, we proposed in a previous study a family of new distortion measures that were shown to be able to withstand additive white noise without requiring 1) explicit knowledge of the noise, 2) noise reduction provisions, or 3) reference template retraining. One particularly effective distortion measure in the family is the one that takes into account the norm shrinkage bias in the noisy cepstrum. In this paper, we incorporate a first-order equalization mechanism, specifically aiming at avoiding the norm shrinkage problem, in a hidden Markov model (HMM) framework to model the speech cepstral sequence. Such a modeling technique requires special care as the formulation inevitably involves parameter estimation from a set of data with singular dispersion. We provide solutions to this HMM stochastic modeling problem and give algorithms for estimating the necessary model parameters. We experimentally show that incorporation of the first-order mean equalization model makes the HMM-based speech recognizer robust to noise. With respect to a conventional HMM recognizer, this leads to an improvement in recognition performance which is equivalent to about 15-20 dB gain in signal-to-noise ratio.Keywords
This publication has 7 references indexed in Scilit:
- An Introduction to Hidden Markov ModelsCurrent Protocols in Bioinformatics, 2007
- Recognition of noisy speech using cumulant-based linear prediction analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- A family of distortion measures based upon projection operation for robust speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- On the use of bandpass liftering in speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
- An introduction to hidden Markov modelsIEEE ASSP Magazine, 1986
- On the effects of varying filter bank parameters on isolated word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1983
- A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov ChainsThe Annals of Mathematical Statistics, 1970