Differential dynamic programming methods for solving bang-bang control problems
- 1 December 1968
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Automatic Control
- Vol. 13 (6) , 661-675
- https://doi.org/10.1109/tac.1968.1099026
Abstract
Differential dynamic programming is a technique, based on dynamic programming rather than the calculus of variations, for determining the optimal control function of a nonlinear system. Unlike conventional dynamic programming where the optimal cost function is considered globally, differential dynamic programming applies the principle of optimality in the neighborhood of a nominal, possibly nonoptimal, trajectory. This allows the coefficients of a linear or quadratic expansion of the cost function to be computed in reverse time along the trajectory: these coefficients may then be used to yield a new improved trajectory (i.e., the algorithms are of the "successive sweep" type). A class of nonlinear control problems, linear in the control variables, is studied using differential dynamic programming. It is shown that for the free-end-point problem, the first partial derivatives of the optimal cost function are continuous throughout the state space, and the second partial derivatives experience jumps at switch points of the control function. A control problem that has an aualytic solution is used to illustrate these points. The fixed-end-point problem is converted into an equivalent free-end-point problem by adjoining the end-point constraints to the cost functional using Lagrange multipliers: a useful interpretation for Pontryagin's adjoint variables for this type of problem emerges from this treatment. The above results are used to devise new second- and first-order algorithms for determining the optimal bang-bang control by successively improving a nominal guessed control function. The usefulness of the proposed algorithms is illustrated by the computation of a number of control problem examples.Keywords
This publication has 13 references indexed in Scilit:
- On optimal control problems with discontinuitiesJournal of Mathematical Analysis and Applications, 1968
- A second-order feedback method for optimal control computationsIEEE Transactions on Automatic Control, 1967
- The successive sweep method and dynamic programmingJournal of Mathematical Analysis and Applications, 1967
- Successive approximation methods for the solution of optimal control problemsAutomatica, 1966
- An iterative procedure for computing time-optimal controlsIEEE Transactions on Automatic Control, 1964
- A TRAJECTORY OPTIMIZATION TECHNIQUE BASED UPON THE THEORY OF THE SECOND VARIATIONPublished by American Institute of Aeronautics and Astronautics (AIAA) ,1963
- Time Optimal Control of Linear Systems with Bounded ControlsPublished by Elsevier ,1963
- Variational methods in problems of control and programmingJournal of Mathematical Analysis and Applications, 1961
- Synthesizing time optimal control systemsJournal of Mathematical Analysis and Applications, 1960
- On the “bang-bang” control problemQuarterly of Applied Mathematics, 1956