Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods

1 October 1999

journal article
Published by MIT Press in Neural Computation

Vol. 11 (7) , 1769-1796
https://doi.org/10.1162/089976699300016223

Abstract

This article focuses on gradient-based backpropagation algorithms that use either a common adaptive learning rate for all weights or an individual adaptive learning rate for each weight and apply the Goldstein/Armijo line search. The learning-rate adaptation is based on descent techniques and estimates of the local Lipschitz constant that are obtained without additional error function and gradient evaluations. The proposed algorithms improve the backpropagation training in terms of both convergence rate and convergence characteristics, such as stable learning and robustness to oscillations. Simulations are conducted to compare and evaluate the convergence behavior of these gradient-based training algorithms with several popular training methods.

Keywords

This publication has 23 references indexed in Scilit:

An adaptive training algorithm for back-propagation neural networks
IEEE Transactions on Systems, Man, and Cybernetics, 1995
Learning with limited numerical precision using the cascade-correlation algorithm
IEEE Transactions on Neural Networks, 1992
First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's Method
Neural Computation, 1992
On the problem of local minima in backpropagation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Back-propagation algorithm which varies the number of hidden units
Neural Networks, 1991
An adaptive training algorithm for back propagation networks
Computer Speech & Language, 1987
Quasi-Newton Methods, Motivation and Theory
SIAM Review, 1977
Minimization of functions having Lipschitz continuous first partial derivatives
Pacific Journal of Mathematics, 1966
Cauchy's method of minimization
Numerische Mathematik, 1962
AN APPLICATION OF THE METHOD OF STEEPEST DESCENTS TO THE SOLUTION OF SYSTEMS OF NON-LINEAR SIMULTANEOUS EQUATIONS
The Quarterly Journal of Mechanics and Applied Mathematics, 1949