On-line versus off-line learning in the linear perceptron: A comparative study

1 September 1995

journal article
research article
Published by American Physical Society (APS) in Physical Review E

Vol. 52 (3) , 2878-2886
https://doi.org/10.1103/physreve.52.2878

Abstract

The spherical perceptron with N inputs and a linear output does not present optimal generalization if trained by minimization of the standard quadratic cost function E=1/2

J_{μ = 1}^{α N}

(

b_{μ}

h_{μ}

)^{2}

, where

b_{μ}

and

h_{μ}

are the outputs from the rule (teacher) and hypothesis (student) networks for the example μ and there are αN examples. We derive an optimal algorithm for on-line learning of examples which outperforms the iterative (off-line) standard algorithm for α up to 0.71. The on-line optimized algorithm suggests a class of cost functions for off-line learning, which we then proceed to study using the replica method. The optimized cost function within that class has the suggestive form E=αN[Γ(1/αN)

J_{μ = 1}^{α N}

[-lnP(

b_{μ}

‖

h_{μ}

)]-Γ lnZ], where Z is a normalization constant, P(

b_{μ}

‖

h_{μ}

) is the conditional probability of the output data

b_{μ}

given the hypothesis output

h_{μ}

, and Γ is a learning parameter analogous to a temperature which decreases in a well defined manner along the learning process.

Keywords

This publication has 17 references indexed in Scilit:

Equilibrium properties of the linear perceptron
Journal of Physics A: General Physics, 1993
Learning and generalization in a linear perceptron stochastically trained with noisy data
Journal of Physics A: General Physics, 1993
Learning drifting concepts with neural networks
Journal of Physics A: General Physics, 1993
The statistical mechanics of learning a rule
Reviews of Modern Physics, 1993
Generalization ability of perceptrons with continuous outputs
Physical Review E, 1993
Optimal generalization in perceptions
Journal of Physics A: General Physics, 1992
Biased learning in Boolean perceptrons
Physica A: Statistical Mechanics and its Applications, 1992
Statistical mechanics of learning from examples
Physical Review A, 1992
Generalization in a linear perceptron in the presence of noise
Journal of Physics A: General Physics, 1992
Improving a Network Generalization Ability by Selecting Examples
Europhysics Letters, 1990