|
Teoriya Veroyatnostei i ee Primeneniya, 1969, Volume 14, Issue 2, Pages 284–291
(Mi tvp1171)
|
|
|
|
This article is cited in 1 scientific paper (total in 1 paper)
The optimal policy in games with unbounded sequence of moves
Yu. I. Kifer Moscow
Abstract:
Two players gamble a game on a finite set $X$. For each $x\in X$, a subset $\Gamma_x\subseteq X$ and a number $\varepsilon(x)$ equal to 1 or to $-1$ are defined. Player I may move from a point $x\in\{x\colon\varepsilon(x)=1\}$ to any point $y\in\Gamma_x$; this move takes a time $t(xy)$ and gives him a payoff $g(xy)$ payed by player II. The same situation exists if to replace player I by player II and points $x\in\{x\colon\varepsilon(x)=1\}$ by $x\in\{x\colon\varepsilon(x)=-1\}$. Player I tries to maximize the average income per unit time
$$
\varphi(x_0)=\varlimsup_{n\to\infty}\frac{\sum_{k=1}^n\varepsilon(x_{k-1})g(x_{k-1}x_k)}{\sum_{k=1}^nt(x+{k-1}x_k)}
$$
Along with this game the finite game is considered which terminates when the players get to a point for the second time. The income of the 1st player is defined as the average payoff per cycle. The existence of optimal stationary policies is proved. These policies coincide with stationary ones in the infinite game. It enables to construct optimal policies by the usual linear programming method for finite games.
Received: 24.03.1967
Citation:
Yu. I. Kifer, “The optimal policy in games with unbounded sequence of moves”, Teor. Veroyatnost. i Primenen., 14:2 (1969), 284–291; Theory Probab. Appl., 14:2 (1969), 279–286
Linking options:
https://www.mathnet.ru/eng/tvp1171 https://www.mathnet.ru/eng/tvp/v14/i2/p284
|
|