|
Problemy Peredachi Informatsii, 2018, Volume 54, Issue 1, Pages 93–111
(Mi ppi2262)
|
|
|
|
This article is cited in 18 scientific papers (total in 18 papers)
Large Systems
Gaussian two-armed bandit and optimization of batch data processing
A. V. Kolnogorov Department of Applied Mathematics and Information Science,
Yaroslav-the-Wise Novgorod State University, Novgorod the Great, Russia
Abstract:
We consider the minimax setting for the two-armed bandit problem with normally distributed incomes having a priori unknown mathematical expectations and variances. This setting naturally arises in optimization of batch data processing where two alternative processing methods are available with different a priori unknown efficiencies. During the control process, it is required to determine the most efficient method and ensure its predominant application. We use the main theorem of game theory to search for minimax strategy and minimax risk as Bayesian ones corresponding to the worst-case prior distribution. To find them, a recursive integro-difference equation is obtained. We show that batch data processing almost does not increase the minimax risk if the number of batches is large enough.
Received: 11.09.2017 Revised: 24.11.2017
Citation:
A. V. Kolnogorov, “Gaussian two-armed bandit and optimization of batch data processing”, Probl. Peredachi Inf., 54:1 (2018), 93–111; Problems Inform. Transmission, 54:1 (2018), 84–100
Linking options:
https://www.mathnet.ru/eng/ppi2262 https://www.mathnet.ru/eng/ppi/v54/i1/p93
|
Statistics & downloads: |
Abstract page: | 371 | Full-text PDF : | 49 | References: | 40 | First page: | 16 |
|