Backpropagation separates when perceptrons do

1 January 1989

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 639-642 vol.1
https://doi.org/10.1109/ijcnn.1989.118644

Abstract

Consideration is given to the behavior of the least-squares problem that arises when one attempts to train a feedforward net with no hidden neurons. It is assumed that the net has monotonic nonlinear output units. Under the assumption that a training set is separable, that is, that there is a set of achievable outputs for which the error is zero, the authors show that there are no nonglobal minima. More precisely, they assume that the error is of a threshold least-mean square (LMS) type, in that the error function is zero for values beyond the target value. The authors' proof gives, in addition, the following stronger result: the continuous gradient adjustment procedure is such that from any initial weight configuration a separating set of weights is obtained in finite time. Thus they have a precise analog of the perceptron learning theorem. The authors contrast their results with the more classical pattern recognition problem of threshold LMS with linear output units.

Keywords

This publication has 3 references indexed in Scilit:

Gradient descent fails to separate
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1988
Parallel Distributed Processing
Published by MIT Press ,1986
The Stability of Dynamical Systems
Published by Society for Industrial & Applied Mathematics (SIAM) ,1976