Variational Learning for Switching State-Space Models

1 April 2000

journal article
research article
Published by MIT Press in Neural Computation

Vol. 12 (4) , 831-864
https://doi.org/10.1162/089976600300015619

Abstract

We introduce a new statistical model for time series that iteratively segments data into regimes with approximately linear dynamics and learns the parameters of each of these linear regimes. This model combines and generalizes two of the most widely used stochastic time-series models—hidden Markov models and linear dynamical systems—and is closely related to models that are widely used in the control and econometrics literatures. It can also be derived by extending the mixture of experts neural network (Jacobs, Jordan, Nowlan, & Hinton, 1991) to its fully dynamical version, in which both expert and gating networks are recurrent. Inferring the posterior probabilities of the hidden states of this model is computationally intractable, and therefore the exact expectation maximization (EM) algorithm cannot be applied. However, we present a variational approximation that maximizes a lower bound on the log-likelihood and makes use of both the forward and backward recursions for hidden Markov models and the Kalman filter recursions for linear dynamical systems. We tested the algorithm on artificial data sets and a natural data set of respiration force from a patient with sleep apnea. The results suggest that variational approximations are a viable method for inference and learning in switching state-space models.

Keywords

This publication has 29 references indexed in Scilit:

Time-Series Segmentation Using Predictive Modular Neural Networks
Neural Computation, 1997
A mixture-of-experts framework for adaptive Kalman filtering
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1997
Probabilistic Independence Networks for Hidden Markov Probability Models
Neural Computation, 1997
Modeling the manifolds of images of handwritten digits
IEEE Transactions on Neural Networks, 1997
Annealed Competition of Experts for a Segmentation and Classification of Switching Dynamics
Neural Computation, 1996
Hidden Markov models for fault detection in dynamic systems
Pattern Recognition, 1994
A stochastic model of speech incorporating hierarchical nonstationarity
IEEE Transactions on Speech and Audio Processing, 1993
Hidden Markov Models for Speech Recognition
Technometrics, 1991
Adaptive Mixtures of Local Experts
Neural Computation, 1991
New Results in Linear Filtering and Prediction Theory
Journal of Basic Engineering, 1961