|
Problemy Peredachi Informatsii, 1973, Volume 9, Issue 4, Pages 58–65
(Mi ppi923)
|
|
|
|
Automata Theory
A Model of Optimal Behavior in an Unknown Medium
A. V. Vasil'ev, A. V. Koganov
Abstract:
A model of adaptation to an unknown medium in which the learning system has several actions is analyzed, where the medium yields a fixed payoff after each action. The system has a finite learning time and must maximize the sum of the payoffs in that time. The medium is not known beforehand, but a class of media is specified. The given model can be treated as a game in which the strategies are behavior algorithms for the system (which remembers the previously obtained responses of the medium) and the specification, prior to the initiation of operation of the system, of a certain medium from the admissible class of media; the latter strategy is identified with “Nature” as a player. The minimax and maximin points are investigated for the game, and a saddle point is found for a class of mixed strategies.
Received: 20.03.1972
Citation:
A. V. Vasil'ev, A. V. Koganov, “A Model of Optimal Behavior in an Unknown Medium”, Probl. Peredachi Inf., 9:4 (1973), 58–65; Problems Inform. Transmission, 9:4 (1973), 314–320
Linking options:
https://www.mathnet.ru/eng/ppi923 https://www.mathnet.ru/eng/ppi/v9/i4/p58
|
Statistics & downloads: |
Abstract page: | 236 | Full-text PDF : | 86 |
|