On-line versus off-line learning in the linear perceptron: A comparative study
- 1 September 1995
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review E
- Vol. 52 (3) , 2878-2886
- https://doi.org/10.1103/physreve.52.2878
Abstract
The spherical perceptron with N inputs and a linear output does not present optimal generalization if trained by minimization of the standard quadratic cost function E=1/2 (- , where and are the outputs from the rule (teacher) and hypothesis (student) networks for the example μ and there are αN examples. We derive an optimal algorithm for on-line learning of examples which outperforms the iterative (off-line) standard algorithm for α up to 0.71. The on-line optimized algorithm suggests a class of cost functions for off-line learning, which we then proceed to study using the replica method. The optimized cost function within that class has the suggestive form E=αN[Γ(1/αN) [-lnP(‖)]-Γ lnZ], where Z is a normalization constant, P(‖) is the conditional probability of the output data given the hypothesis output , and Γ is a learning parameter analogous to a temperature which decreases in a well defined manner along the learning process.
Keywords
This publication has 17 references indexed in Scilit:
- Equilibrium properties of the linear perceptronJournal of Physics A: General Physics, 1993
- Learning and generalization in a linear perceptron stochastically trained with noisy dataJournal of Physics A: General Physics, 1993
- Learning drifting concepts with neural networksJournal of Physics A: General Physics, 1993
- The statistical mechanics of learning a ruleReviews of Modern Physics, 1993
- Generalization ability of perceptrons with continuous outputsPhysical Review E, 1993
- Optimal generalization in perceptionsJournal of Physics A: General Physics, 1992
- Biased learning in Boolean perceptronsPhysica A: Statistical Mechanics and its Applications, 1992
- Statistical mechanics of learning from examplesPhysical Review A, 1992
- Generalization in a linear perceptron in the presence of noiseJournal of Physics A: General Physics, 1992
- Improving a Network Generalization Ability by Selecting ExamplesEurophysics Letters, 1990