|
Problemy Peredachi Informatsii, 2015, Volume 51, Issue 2, Pages 99–113
(Mi ppi2173)
|
|
|
|
This article is cited in 8 scientific papers (total in 8 papers)
Large Systems
One-armed bandit problem for parallel data processing systems
A. V. Kolnogorov Applied Mathematics and Information Science Department, Yaroslav-the-Wise Novgorod State University, Yaroslav, Russia
Abstract:
We consider the minimax setting for the one-armed bandit problem, i.e., for the two-armed bandit problem with a known distribution function of incomes corresponding to the first action. Incomes that correspond to the second action have normal distribution functions with unit variance and an unknown mathematical expectation. According to the main theorem of game theory, the minimax strategy and minimax risk are sought for as Bayesian, corresponding to the worst-case prior distribution. Results can be applied to parallel data processing systems if there are two processing methods available with an a priori known efficiency of the first.
Received: 02.09.2014 Revised: 25.02.2015
Citation:
A. V. Kolnogorov, “One-armed bandit problem for parallel data processing systems”, Probl. Peredachi Inf., 51:2 (2015), 99–113; Problems Inform. Transmission, 51:2 (2015), 177–191
Linking options:
https://www.mathnet.ru/eng/ppi2173 https://www.mathnet.ru/eng/ppi/v51/i2/p99
|
Statistics & downloads: |
Abstract page: | 410 | Full-text PDF : | 81 | References: | 37 | First page: | 15 |
|