On-line learning in soft committee machines

1 October 1995

journal article
research article
Published by American Physical Society (APS) in Physical Review E

Vol. 52 (4) , 4225-4243
https://doi.org/10.1103/physreve.52.4225

Abstract

The problem of on-line learning in two-layer neural networks is studied within the framework of statistical mechanics. A fully connected committee machine with K hidden units is trained by gradient descent to perform a task defined by a teacher committee machine with M hidden units acting on randomly drawn inputs. The approach, based on a direct averaging over the activation of the hidden units, results in a set of first-order differential equations that describes the dynamical evolution of the overlaps among the various hidden units and allows for a computation of the generalization error. The equations of motion are obtained analytically for general K and M and provide a powerful tool used here to study a variety of realizable, over-realizable, and unrealizable learning scenarios and to analyze the role of the learning rate in controlling the evolution and convergence of the learning process.

Keywords

This publication has 14 references indexed in Scilit:

On-Line Learning with a Perceptron
Europhysics Letters, 1994
Perfect loss of generalization due to noise in K=2 parity machines
Journal of Physics A: General Physics, 1994
Learning a rule in a multilayer neural network
Journal of Physics A: General Physics, 1993
The statistical mechanics of learning a rule
Reviews of Modern Physics, 1993
Optimal generalization in perceptions
Journal of Physics A: General Physics, 1992
Statistical mechanics of learning from examples
Physical Review A, 1992
Learning processes in neural networks
Physical Review A, 1991
Approximation by superpositions of a sigmoidal function
Mathematics of Control, Signals, and Systems, 1989
Multilayer feedforward networks are universal approximators
Neural Networks, 1989
Infinite Number of Order Parameters for Spin-Glasses
Physical Review Letters, 1979