Learning and Generalization in Radial Basis Function Networks

1 September 1995

journal article
Published by MIT Press in Neural Computation

Vol. 7 (5) , 1000-1020
https://doi.org/10.1162/neco.1995.7.5.1000

Abstract

The two-layer radial basis function network, with fixed centers of the basis functions, is analyzed within a stochastic training paradigm. Various definitions of generalization error are considered, and two such definitions are employed in deriving generic learning curves and generalization properties, both with and without a weight decay term. The generalization error is shown analytically to be related to the evidence and, via the evidence, to the prediction error and free energy. The generalization behavior is explored; the generic learning curve is found to be inversely proportional to the number of training pairs presented. Optimization of training is considered by minimizing the generalization error with respect to the free parameters of the training algorithms. Finally, the effect of the joint activations between hidden-layer units is examined and shown to speed training.

Keywords

This publication has 12 references indexed in Scilit:

On Langevin Updating in Multilayer Perceptrons
Neural Computation, 1994
Statistical mechanics of hypothesis evaluation
Journal of Physics A: General Physics, 1994
Approximation and estimation bounds for artificial neural networks
Machine Learning, 1994
Stochastic linear learning: Exact test and training error averages
Neural Networks, 1993
Learning a rule in a multilayer neural network
Journal of Physics A: General Physics, 1993
Universal approximation bounds for superpositions of a sigmoidal function
IEEE Transactions on Information Theory, 1993
On the training of radial basis function classifiers
Neural Networks, 1992
Bayesian Interpolation
Neural Computation, 1992
Neural networks and radial basis functions in classifying static speech patterns
Computer Speech & Language, 1990
Layered Neural Networks with Gaussian Hidden Units as Universal Approximations
Neural Computation, 1990