Modeling state durations in hidden Markov models for automatic speech recognition

1 January 1992

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15206149) , 381-384 vol.1
https://doi.org/10.1109/icassp.1992.225892

Abstract

Hidden Markov modeling (HMM) techniques have been used successfully for connected speech recognition in the last several years. In the traditional HMM algorithms, the probability of duration of a state decreases exponentially with time which is not appropriate for representing the temporal structure of speech. Non-parametric modeling of duration using semi-Markov chains does accomplish the task with a large increase in the computational complexity. Applying a postprocessing state duration penalty after Viterbi decoding adds very little computation but does not affect the forward recognition path. The authors present a way of modeling state durations in HMM using time-dependent state transitions. This inhomogeneous HMM (IHMM) does increase the computation by a small amount but reduces recognition error rates by 14-25%. Also, a suboptimal implementation of this scheme that requires no more computation than the traditional HMM is presented which also has reduced errors by 14-22% on a variety of databases.

Keywords

This publication has 4 references indexed in Scilit:

Some statistical issues in the comparison of speech recognition algorithms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A tutorial on hidden Markov models and selected applications in speech recognition
Proceedings of the IEEE, 1989
Continuously variable duration hidden Markov models for automatic speech recognition
Computer Speech & Language, 1986
Explicit modelling of state occupancy in hidden Markov models for automatic speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1985