A Semigroup Representation of the Maximum Expected Reward Vector in Continuous Parameter Markov Decision Theory
- 1 November 1975
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Control
- Vol. 13 (6) , 1115-1129
- https://doi.org/10.1137/0313069
Abstract
The maximum expected reward vector that arises in continuous parameter Markov decision problems is frequently characterized as the unique solution of a certain Cauchy problem. This paper generalizes this characterization by viewing the maximum expected reward vector as a nonlinear semigroup in an appropriate Banach space. This perspective has several advantages. First, the semi-group may exist even though the corresponding Cauchy problem does not have a solution. Second, this approach is often useful in showing when the Cauchy problem does have a solution. Third, these methods are useful in the study of the method of successive approximations. Finally, these methods appear likely to unify some diverse results in Markov decision theory.The results in this paper are very general. First, sufficient conditions are given for the semigroup to exist. The discounted reward case is studied next ; a certain operator is shown to have a unique singular point that is the strong limit of the semigroup as the parameter ...Keywords
This publication has 18 references indexed in Scilit:
- A fixed point theorem for asymptotically nonexpansive mappingsProceedings of the American Mathematical Society, 1972
- Continuously Discounted Markov Decision Model with Countable State and Action SpaceThe Annals of Mathematical Statistics, 1971
- Generation of Semi-Groups of Nonlinear Transformations on General Banach SpacesAmerican Journal of Mathematics, 1971
- Optimal Stopping of a Markov ProcessTheory of Probability and Its Applications, 1971
- Optimal Stopping Rules for Stochastic Processes with Continuous ParameterTheory of Probability and Its Applications, 1970
- Optimal Continuous-Parameter Stochastic ControlSIAM Review, 1969
- The solution by iteration of nonlinear functional equations in Banach spacesBulletin of the American Mathematical Society, 1966
- NONEXPANSIVE NONLINEAR OPERATORS IN A BANACH SPACEProceedings of the National Academy of Sciences, 1965
- Markov ProcessesPublished by Springer Nature ,1965
- Uniformly convex spacesTransactions of the American Mathematical Society, 1936