A Method for Approximate Solutions to Stochastic Dynamic Programming Problems Using Expectations
- 1 April 1968
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Operations Research
- Vol. 16 (2) , 296-306
- https://doi.org/10.1287/opre.16.2.296
Abstract
This note describes and illustrates a computational technique to obtain approximate solutions to stochastic dynamic programming problems. The technique is to replace probability distributions by their corresponding expectations, and to use the values of the states in the corresponding deterministic system under its optimal policy to determine an approximate policy in the stochastic system through a single application of Howard's policy improvement operation. Two examples are given.Keywords
This publication has 0 references indexed in Scilit: