Adaptive dynamic programming

Top Cited Papers

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews)

Vol. 32 (2) , 140-153
https://doi.org/10.1109/tsmcc.2002.801727

Abstract

Unlike the many soft computing applications where it suffices to achieve a "good approximation most of the time," a control system must be stable all of the time. As such, if one desires to learn a control law in real-time, a fusion of soft computing techniques to learn the appropriate control law with hard computing techniques to maintain the stability constraint and guarantee convergence is required. The objective of the paper is to describe an adaptive dynamic programming algorithm (ADPA) which fuses soft computing techniques to learn the optimal cost (or return) functional for a stabilizable nonlinear system with unknown dynamics and hard computing techniques to verify the stability and convergence of the algorithm. Specifically, the algorithm is initialized with a (stabilizing) cost functional and the system is run with the corresponding control law (defined by the Hamilton-Jacobi-Bellman equation), with the resultant state trajectories used to update the cost functional in a soft computing mode. Hard computing techniques are then used to show that this process is globally convergent with stepwise stability to the optimal cost functional/control law pair for an (unknown) input affine system with an input quadratic performance measure (modulo the appropriate technical conditions). Three specific implementations of the ADPA are developed for 1) the linear case, 2) for the nonlinear case using a locally quadratic approximation to the cost functional, and 3) the nonlinear case using a radial basis function approximation of the cost functional; illustrated by applications to flight control.

Keywords

This publication has 17 references indexed in Scilit:

A neurocontroller for robotic applications
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Adaptive critic design in learning to play game of Go
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Development and Flight Test of the X-43A-LS Hypersonic Configuration UAV
Published by American Institute of Aeronautics and Astronautics (AIAA) ,2002
Adaptive neural network control of nonlinear systems by state and output feedback
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1999
Variable neural networks for adaptive control of nonlinear systems
IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews), 1999
LoFLYTE - A neurocontrols testbed
Published by American Institute of Aeronautics and Astronautics (AIAA) ,1997
Fuzzy control stabilization with applications to motorcycle control
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 1996
Direct Adaptive and Neural Control of Wing-Rock Motion of Slender Delta Wings
Journal of Guidance, Control, and Dynamics, 1995
Stable adaptive fuzzy control of nonlinear systems
IEEE Transactions on Fuzzy Systems, 1993
Asymptotic Estimates for Solutions of Linear Systems of Ordinary Differential Equations Having Multiple Characteristic Roots
Indiana University Mathematics Journal, 1972