The asymptotic optimality of discretized linear reward-inaction learning automata
- 1 May 1984
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics
- Vol. SMC-14 (3) , 542-545
- https://doi.org/10.1109/tsmc.1984.6313256
Abstract
The automata considered have a variable structure and hence are completely described by action probability updating functions. The action probabilities can take only a finite number of prespecified values. These values linearly increase and the interval [0, 1] is divided into a number of equal length subintervals. The probability is updated by the automata only if the environment responds with a reward and hence they are called discretized linear reward-inaction automata. The asymptotic optimality of this family of automata is proved for all environments.Keywords
This publication has 0 references indexed in Scilit: