Training hidden Markov models with multiple observations-a combinatorial method
- 1 April 2000
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 22 (4) , 371-377
- https://doi.org/10.1109/34.845379
Abstract
Hidden Markov models (HMM) are stochastic models capable of statistical learning and classification. They have been applied in speech recognition and handwriting recognition because of their great adaptability and versatility in handling sequential signals. On the other hand, as these models have a complex structure and also because the involved data sets usually contain uncertainty, it is difficult to analyze the multiple observation training problem without certain assumptions. For many years researchers have used the training equations of Levinson (1983) in speech and handwriting applications, simply assuming that all observations are independent of each other. This paper presents a formal treatment of HMM multiple observation training without imposing the above assumption. In this treatment, the multiple observation probability is expressed as a combination of individual observation probabilities without losing generality. This combinatorial method gives one more freedom in making different dependence-independence assumptions. By generalizing Baum's auxiliary function into this framework and building up an associated objective function using the Lagrange multiplier method, it is proven that the derived training equations guarantee the maximization of the objective function. Furthermore, we show that Levinson's training equations can be easily derived as a special case in this treatment.Keywords
This publication has 20 references indexed in Scilit:
- An HMM/MLP Architecture for Sequence RecognitionNeural Computation, 1995
- Robust estimation of HMM parameters using fuzzy vector quantization and Parzen's windowPattern Recognition, 1995
- Hidden Markov models applied to on-line handwritten isolated character recognitionIEEE Transactions on Image Processing, 1994
- Recognition of handwritten word: First and second order hidden Markov model based approachPattern Recognition, 1989
- A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov models and level buildingIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech RecognitionBell System Technical Journal, 1983
- On the Convergence Properties of the EM AlgorithmThe Annals of Statistics, 1983
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov ChainsThe Annals of Mathematical Statistics, 1970
- Growth transformations for functions on manifoldsPacific Journal of Mathematics, 1968