|
Teoriya Veroyatnostei i ee Primeneniya, 1980, Volume 25, Issue 1, Pages 71–82
(Mi tvp986)
|
|
|
|
This article is cited in 16 scientific papers (total in 16 papers)
An $\varepsilon$-optimal control of finite Markov chain with average
reward criterion
E. A. Feĭnberg Moscow
Abstract:
Discrete time Markov decition chain with average reward criterion is considered.
It is proved that if the state space is finite and the sets of actions are measurable subsets
of Polish space, then there exist non-randomized Markov $\varepsilon$-optimal policies. An example
showing that there exists a Markov decition chain with countable state space and finite
sets of actions such that randomized Markov $\varepsilon$-optimal policies for this chain don't
exist is constructed.
Received: 24.08.1978
Citation:
E. A. Feǐnberg, “An $\varepsilon$-optimal control of finite Markov chain with average
reward criterion”, Teor. Veroyatnost. i Primenen., 25:1 (1980), 71–82; Theory Probab. Appl., 25:1 (1980), 70–81
Linking options:
https://www.mathnet.ru/eng/tvp986 https://www.mathnet.ru/eng/tvp/v25/i1/p71
|
Statistics & downloads: |
Abstract page: | 206 | Full-text PDF : | 96 |
|