Dynamic programming and gambling models
- 1 September 1974
- journal article
- Published by Cambridge University Press (CUP) in Advances in Applied Probability
- Vol. 6 (3) , 593-606
- https://doi.org/10.2307/1426236
Abstract
Dynamic programming is used to solve some simple gambling models. In particular we consider the situation where an individual may bet any integral amount not greater than his fortune and he will win this amount with probability p or lose it with probability 1 — p. It is shown that if p ≧ ½ then the timid strategy (always bet one dollar) both maximizes the probability of ever reaching any preassigned fortune, and also stochastically maximizes the time until the bettor becomes broke. Also, if p ≦ ½ then the timid strategy while not stochastically maximizing the playing time does maximize the expected playing time. We also consider the same model but with the additional structure that the bettor need not gamble but may instead elect to work for some period of time. His goal is to minimize the expected time until his fortune reaches some preassigned goal. We show that if p ≦ ½ then (i) always working is optimal, and (ii) among those strategies that only allow working when the bettor is broke it is the bold strategy that is optimalKeywords
This publication has 7 references indexed in Scilit:
- On Stationary PoliciesJournal of the Royal Statistical Society. Series A (General), 1970
- On the existence of stationary optimal strategiesProceedings of the American Mathematical Society, 1969
- Optimal Gambling Systems for Favorable GamesRevue de l'Institut International de Statistique / Review of the International Statistical Institute, 1969
- Timid Play is OptimalThe Annals of Mathematical Statistics, 1967
- How to Survive a Fixed Number of Fair BetsThe Annals of Mathematical Statistics, 1967
- Negative Dynamic ProgrammingThe Annals of Mathematical Statistics, 1966
- A Note on Memoryless Rules for Controlling Sequential Control ProcessesThe Annals of Mathematical Statistics, 1966