The asymptotic optimality of discretized linear reward-inaction learning automata

1 May 1984

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics

Vol. SMC-14 (3) , 542-545
https://doi.org/10.1109/tsmc.1984.6313256

Abstract

The automata considered have a variable structure and hence are completely described by action probability updating functions. The action probabilities can take only a finite number of prespecified values. These values linearly increase and the interval [0, 1] is divided into a number of equal length subintervals. The probability is updated by the automata only if the environment responds with a reward and hence they are called discretized linear reward-inaction automata. The asymptotic optimality of this family of automata is proved for all environments.

Keywords

LEARNING AUTOMATA
AUTOMATA
TIN
ACCURACY
GOLD
CONVERGENCE

This publication has 0 references indexed in Scilit: