Variational Learning for Switching State-Space Models
- 1 April 2000
- journal article
- research article
- Published by MIT Press in Neural Computation
- Vol. 12 (4) , 831-864
- https://doi.org/10.1162/089976600300015619
Abstract
We introduce a new statistical model for time series that iteratively segments data into regimes with approximately linear dynamics and learns the parameters of each of these linear regimes. This model combines and generalizes two of the most widely used stochastic time-series models—hidden Markov models and linear dynamical systems—and is closely related to models that are widely used in the control and econometrics literatures. It can also be derived by extending the mixture of experts neural network (Jacobs, Jordan, Nowlan, & Hinton, 1991) to its fully dynamical version, in which both expert and gating networks are recurrent. Inferring the posterior probabilities of the hidden states of this model is computationally intractable, and therefore the exact expectation maximization (EM) algorithm cannot be applied. However, we present a variational approximation that maximizes a lower bound on the log-likelihood and makes use of both the forward and backward recursions for hidden Markov models and the Kalman filter recursions for linear dynamical systems. We tested the algorithm on artificial data sets and a natural data set of respiration force from a patient with sleep apnea. The results suggest that variational approximations are a viable method for inference and learning in switching state-space models.Keywords
This publication has 29 references indexed in Scilit:
- Time-Series Segmentation Using Predictive Modular Neural NetworksNeural Computation, 1997
- A mixture-of-experts framework for adaptive Kalman filteringIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1997
- Probabilistic Independence Networks for Hidden Markov Probability ModelsNeural Computation, 1997
- Modeling the manifolds of images of handwritten digitsIEEE Transactions on Neural Networks, 1997
- Annealed Competition of Experts for a Segmentation and Classification of Switching DynamicsNeural Computation, 1996
- Hidden Markov models for fault detection in dynamic systemsPattern Recognition, 1994
- A stochastic model of speech incorporating hierarchical nonstationarityIEEE Transactions on Speech and Audio Processing, 1993
- Hidden Markov Models for Speech RecognitionTechnometrics, 1991
- Adaptive Mixtures of Local ExpertsNeural Computation, 1991
- New Results in Linear Filtering and Prediction TheoryJournal of Basic Engineering, 1961