Improving the Convergence of the Backpropagation Algorithm Using Learning Rate Adaptation Methods
- 1 October 1999
- journal article
- Published by MIT Press in Neural Computation
- Vol. 11 (7) , 1769-1796
- https://doi.org/10.1162/089976699300016223
Abstract
This article focuses on gradient-based backpropagation algorithms that use either a common adaptive learning rate for all weights or an individual adaptive learning rate for each weight and apply the Goldstein/Armijo line search. The learning-rate adaptation is based on descent techniques and estimates of the local Lipschitz constant that are obtained without additional error function and gradient evaluations. The proposed algorithms improve the backpropagation training in terms of both convergence rate and convergence characteristics, such as stable learning and robustness to oscillations. Simulations are conducted to compare and evaluate the convergence behavior of these gradient-based training algorithms with several popular training methods.Keywords
This publication has 23 references indexed in Scilit:
- An adaptive training algorithm for back-propagation neural networksIEEE Transactions on Systems, Man, and Cybernetics, 1995
- Learning with limited numerical precision using the cascade-correlation algorithmIEEE Transactions on Neural Networks, 1992
- First- and Second-Order Methods for Learning: Between Steepest Descent and Newton's MethodNeural Computation, 1992
- On the problem of local minima in backpropagationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Back-propagation algorithm which varies the number of hidden unitsNeural Networks, 1991
- An adaptive training algorithm for back propagation networksComputer Speech & Language, 1987
- Quasi-Newton Methods, Motivation and TheorySIAM Review, 1977
- Minimization of functions having Lipschitz continuous first partial derivativesPacific Journal of Mathematics, 1966
- Cauchy's method of minimizationNumerische Mathematik, 1962
- AN APPLICATION OF THE METHOD OF STEEPEST DESCENTS TO THE SOLUTION OF SYSTEMS OF NON-LINEAR SIMULTANEOUS EQUATIONSThe Quarterly Journal of Mechanics and Applied Mathematics, 1949