|
Teoriya Veroyatnostei i ee Primeneniya, 1970, Volume 15, Issue 4, Pages 740–745
(Mi tvp1941)
|
|
|
|
This article is cited in 9 scientific papers (total in 9 papers)
Short Communications
On a problem of D. Blackwell from the theory of dynamic programming
E. B. Frid Moscow
Abstract:
In this paper the positive case of a dynamic programming problem is considered. We prove that, for any probability $p$ on the set of states $S$ and $\lambda<1$, there exists a stationary policy $\pi^*$ such that
$$
p\{I^{\pi^*}\ge\lambda\sup_\pi I^\pi\}=1,
$$
where $I^\pi$ is the mean reward.
Received: 03.06.1969
Citation:
E. B. Frid, “On a problem of D. Blackwell from the theory of dynamic programming”, Teor. Veroyatnost. i Primenen., 15:4 (1970), 740–745; Theory Probab. Appl., 15:4 (1970), 719–722
Linking options:
https://www.mathnet.ru/eng/tvp1941 https://www.mathnet.ru/eng/tvp/v15/i4/p740
|
Statistics & downloads: |
Abstract page: | 347 | Full-text PDF : | 83 |
|