A distributed asynchronous algorithm for expected average cost dynamic programming
- 1 January 1990
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 1394-1395 vol.3
- https://doi.org/10.1109/cdc.1990.203839
Abstract
A distributed asynchronous implementation of the value-iteration algorithm in dynamic programming is presented. The iteration step is carried out by a number of processors, each iterating on a subset of the value function vector. Each processor transmits its computed coordinates to other processors. The algorithm converges when the different processors iterate at different speeds. The information received by a processor regarding other coordinates may be outdated, and there may be an unpredictable delay in receiving information from other processors.<>Keywords
This publication has 1 reference indexed in Scilit:
- Adaptive control of Markov chains with local updatesSystems & Control Letters, 1990