Learning and Generalization in Radial Basis Function Networks
- 1 September 1995
- journal article
- Published by MIT Press in Neural Computation
- Vol. 7 (5) , 1000-1020
- https://doi.org/10.1162/neco.1995.7.5.1000
Abstract
The two-layer radial basis function network, with fixed centers of the basis functions, is analyzed within a stochastic training paradigm. Various definitions of generalization error are considered, and two such definitions are employed in deriving generic learning curves and generalization properties, both with and without a weight decay term. The generalization error is shown analytically to be related to the evidence and, via the evidence, to the prediction error and free energy. The generalization behavior is explored; the generic learning curve is found to be inversely proportional to the number of training pairs presented. Optimization of training is considered by minimizing the generalization error with respect to the free parameters of the training algorithms. Finally, the effect of the joint activations between hidden-layer units is examined and shown to speed training.Keywords
This publication has 12 references indexed in Scilit:
- On Langevin Updating in Multilayer PerceptronsNeural Computation, 1994
- Statistical mechanics of hypothesis evaluationJournal of Physics A: General Physics, 1994
- Approximation and estimation bounds for artificial neural networksMachine Learning, 1994
- Stochastic linear learning: Exact test and training error averagesNeural Networks, 1993
- Learning a rule in a multilayer neural networkJournal of Physics A: General Physics, 1993
- Universal approximation bounds for superpositions of a sigmoidal functionIEEE Transactions on Information Theory, 1993
- On the training of radial basis function classifiersNeural Networks, 1992
- Bayesian InterpolationNeural Computation, 1992
- Neural networks and radial basis functions in classifying static speech patternsComputer Speech & Language, 1990
- Layered Neural Networks with Gaussian Hidden Units as Universal ApproximationsNeural Computation, 1990