Sequential Monte Carlo Methods to Train Neural Network Models

1 April 2000

journal article
Published by MIT Press in Neural Computation

Vol. 12 (4) , 955-993
https://doi.org/10.1162/089976600300015664

Abstract

We discuss a novel strategy for training neural networks using sequential Monte Carlo algorithms and propose a new hybrid gradient descent/sampling importance resampling algorithm (HySIR). In terms of computational time and accuracy, the hybrid SIR is a clear improvement over conventional sequential Monte Carlo techniques. The new algorithm may be viewed as a global optimization strategy that allows us to learn the probability distributions of the network weights and outputs in a sequential framework. It is well suited to applications involving on-line, nonlinear, and nongaussian signal processing. We show how the new algorithm outperforms extended Kalman filter training on several problems. In particular, we address the problem of pricing option contracts, traded in financial markets. In this context, we are able to estimate the one-step-ahead probability density functions of the options prices.

Keywords

This publication has 35 references indexed in Scilit:

Dynamic Conditional Independence Models and Markov Chain Monte Carlo Methods
Journal of the American Statistical Association, 1997
A fast-weighted Bayesian bootstrap filter for nonlinear model state estimation
IEEE Transactions on Aerospace and Electronic Systems, 1997
Stochastic simulation Bayesian approach to multitarget tracking
IEE Proceedings - Radar, Sonar and Navigation, 1995
Exact adaptive filters for Markov chains observed in Gaussian noise
Automatica, 1994
On-line identification of hidden Markov models via recursive prediction error techniques
IEEE Transactions on Signal Processing, 1994
Hybrid Monte Carlo simulations theory and initial comparison with molecular dynamics
Biopolymers, 1993
Approximation by superpositions of a sigmoidal function
Mathematics of Control, Signals, and Systems, 1989
Bayesian Inference in Econometric Models Using Monte Carlo Integration
Econometrica, 1989
Hybrid Monte Carlo
Physics Letters B, 1987
Random sampling approach to state estimation in switching environments
Automatica, 1977