Learning Algorithms for Two-Person Zero-Sum Stochastic Games with Incomplete Information

1 August 1981

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in Mathematics of Operations Research

Vol. 6 (3) , 379-386
https://doi.org/10.1287/moor.6.3.379

Abstract

This paper investigates conditions under which two learning algorithms playing a zero-sum sequential stochastic game would arrive at optimal pure strategies. Neither player has knowledge of either the pay-off matrix or the choice of strategies available to the other and both players update their own strategies at every stage entirely on the basis of the random outcome at that stage. The proposed learning algorithms are shown to converge to the optimal pure strategies when they exist with probabilities as close to 1 as desired.

Keywords

OPTIMAL
SUM
STOCHASTIC
GAME
LEARNING ALGORITHMS
CONVERGE
MATRIX
INCOMPLETE
SEQUENTIAL

This publication has 0 references indexed in Scilit: