|
Matematicheskaya Teoriya Igr i Ee Prilozheniya, 2017, Volume 9, Issue 4, Pages 69–87
(Mi mgta209)
|
|
|
|
This article is cited in 3 scientific papers (total in 3 papers)
Decision-making model under presence of experts as a modified multi-armed bandit problem
Dmitriy S. Smirnov, Ekaterina V. Gromova Saint-Petersburg State University, Faculty of
Applied Mathematics and Control Process
Abstract:
The modified multi-armed bandit problem is formulated in the paper which allows the player to use so-called expert hints in the decision making process. As a player in this problem is meant some automated system that uses a certain strategy (algorithm) for making a decision under conditions of uncertainty. The approach is developed for the case of $m$ experts. A modification of the well-known UCB1 algorithm is proposed to solve the multi-armed bandit problem. The results of a numerical experiment are given in order to show influence of expert hints on the player's payoff.
Keywords:
multi-armed bandit problem, decision making, optimization methods, machine learning algorithms.
Citation:
Dmitriy S. Smirnov, Ekaterina V. Gromova, “Decision-making model under presence of experts as a modified multi-armed bandit problem”, Mat. Teor. Igr Pril., 9:4 (2017), 69–87
Linking options:
https://www.mathnet.ru/eng/mgta209 https://www.mathnet.ru/eng/mgta/v9/i4/p69
|
|