Learning Algorithms for Two-Person Zero-Sum Stochastic Games with Incomplete Information
- 1 August 1981
- journal article
- Published by Institute for Operations Research and the Management Sciences (INFORMS) in Mathematics of Operations Research
- Vol. 6 (3) , 379-386
- https://doi.org/10.1287/moor.6.3.379
Abstract
This paper investigates conditions under which two learning algorithms playing a zero-sum sequential stochastic game would arrive at optimal pure strategies. Neither player has knowledge of either the pay-off matrix or the choice of strategies available to the other and both players update their own strategies at every stage entirely on the basis of the random outcome at that stage. The proposed learning algorithms are shown to converge to the optimal pure strategies when they exist with probabilities as close to 1 as desired.Keywords
This publication has 0 references indexed in Scilit: