On Fokker-Planck approximations of on-line learning processes

7 August 1994

journal article
Published by IOP Publishing in Journal of Physics A: General Physics

Vol. 27 (15) , 5145-5160
https://doi.org/10.1088/0305-4470/27/15/015

Abstract

There are several ways to describe on-line learning in neural networks. The two major ones are a continuous-time master equation and a discrete-time random-walk equation. The random-walk equation is obtained in the case of fixed time intervals between subsequent learning steps, the master equation results when the time intervals are drawn from a Poisson distribution. Following Van Kampen (1992), we give a rigorous expansion of both the master and the random-walk equation in the limit of small learning parameters. The results explain the difference between the Fokker-Planck approaches proposed by Radons et al (1990) and Hansen et al. (1993). Furthermore, we find that the mathematical validity of these approaches is restricted to local properties of the learning process. Yet Fokker-Planck approaches are often suggested as models to study global properties, such as mean first passage times and stationary solutions. To check their accuracy and usefulness in these situations we compare simulations of two learning procedures with exactly the same drift vector and diffusion matrix, the only moments that are considered in Fokker-Planck approximation. The simulations show that the mean first passage times for these two learning procedures diverge rather than converge for small learning parameters. We reach the conclusion that Fokker-Planck approaches are not accurate enough to compute global properties of on-line learning processes.

Keywords

This publication has 12 references indexed in Scilit:

On stochastic dynamics of supervised learning
Journal of Physics A: General Physics, 1993
Stochastic dynamics of supervised learning
Journal of Physics A: General Physics, 1993
Learning in neural networks with local minima
Physical Review A, 1992
Learning processes in neural networks
Physical Review A, 1991
Convergence properties of Kohonen's topology conserving maps: fluctuations, stability, and dimension selection
Biological Cybernetics, 1988
Learning representations by back-propagating errors
Nature, 1986
Robustness and Approximation of Escape Times and Large Deviations Estimates for Systems with Small Noise Effects
SIAM Journal on Applied Mathematics, 1984
Self-organized formation of topologically correct feature maps
Biological Cybernetics, 1982
The validity of nonlinear Langevin equations
Journal of Statistical Physics, 1981
On the Relation between Master Equations and Random Walks and Their Solutions
Journal of Mathematical Physics, 1971