On Fokker-Planck approximations of on-line learning processes
- 7 August 1994
- journal article
- Published by IOP Publishing in Journal of Physics A: General Physics
- Vol. 27 (15) , 5145-5160
- https://doi.org/10.1088/0305-4470/27/15/015
Abstract
There are several ways to describe on-line learning in neural networks. The two major ones are a continuous-time master equation and a discrete-time random-walk equation. The random-walk equation is obtained in the case of fixed time intervals between subsequent learning steps, the master equation results when the time intervals are drawn from a Poisson distribution. Following Van Kampen (1992), we give a rigorous expansion of both the master and the random-walk equation in the limit of small learning parameters. The results explain the difference between the Fokker-Planck approaches proposed by Radons et al (1990) and Hansen et al. (1993). Furthermore, we find that the mathematical validity of these approaches is restricted to local properties of the learning process. Yet Fokker-Planck approaches are often suggested as models to study global properties, such as mean first passage times and stationary solutions. To check their accuracy and usefulness in these situations we compare simulations of two learning procedures with exactly the same drift vector and diffusion matrix, the only moments that are considered in Fokker-Planck approximation. The simulations show that the mean first passage times for these two learning procedures diverge rather than converge for small learning parameters. We reach the conclusion that Fokker-Planck approaches are not accurate enough to compute global properties of on-line learning processes.Keywords
This publication has 12 references indexed in Scilit:
- On stochastic dynamics of supervised learningJournal of Physics A: General Physics, 1993
- Stochastic dynamics of supervised learningJournal of Physics A: General Physics, 1993
- Learning in neural networks with local minimaPhysical Review A, 1992
- Learning processes in neural networksPhysical Review A, 1991
- Convergence properties of Kohonen's topology conserving maps: fluctuations, stability, and dimension selectionBiological Cybernetics, 1988
- Learning representations by back-propagating errorsNature, 1986
- Robustness and Approximation of Escape Times and Large Deviations Estimates for Systems with Small Noise EffectsSIAM Journal on Applied Mathematics, 1984
- Self-organized formation of topologically correct feature mapsBiological Cybernetics, 1982
- The validity of nonlinear Langevin equationsJournal of Statistical Physics, 1981
- On the Relation between Master Equations and Random Walks and Their SolutionsJournal of Mathematical Physics, 1971