Strong consistency of a modified maximum likelihood estimator for controlled Markov chains
- 1 September 1980
- journal article
- Published by Cambridge University Press (CUP) in Journal of Applied Probability
- Vol. 17 (3) , 726-734
- https://doi.org/10.2307/3212966
Abstract
A controlled Markov chain with finite state space has transition probabilities which depend on an unknown parameter α lying in a known finite set A. For each α, a stationary control law ϕ α is given. This paper develops a control scheme whereby at each stage t a parameter α t is chosen at random from among those parameters which nearly maximize the log likelihood function, and the control ut is chosen according to the control law ϕ αt. It is proved that this algorithm leads to identification of the true α under conditions weaker than any previously considered.Keywords
This publication has 2 references indexed in Scilit:
- Adaptive control of Markov chains, I: Finite parameter setIEEE Transactions on Automatic Control, 1979
- Estimation and control in Markov chainsAdvances in Applied Probability, 1974