Sergey V. Garbar, Alexander V. Kolnogorov, “Customization of J. Bather UCB strategy for a Gaussian multi-armed bandit”, Mat. Teor. Igr Pril., 14:2 (2022), 3

Matematicheskaya Teoriya Igr i Ee Prilozheniya

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Mat. Teor. Igr Pril.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Matematicheskaya Teoriya Igr i Ee Prilozheniya, 2022, Volume 14, Issue 2, Pages 3–30 (Mi mgta299)

Customization of J. Bather UCB strategy for a Gaussian multi-armed bandit

Sergey V. Garbar, Alexander V. Kolnogorov

Yaroslav-the-Wise Novgorod State University

Full-text PDF (307 kB)

References:

PDF

HTML

Abstract: We consider the customization of the UCB strategy, which was first proposed by J. Bather for Bernoulli two-armed bandit, to the case of a Gaussian multi-armed bandit describing batch data processing. This optimal control problem has classical interpretation as a game with nature, in which the payment function of the player is the expected loss of total income caused by incomplete information. The goal is stated in minimax setting. For the considered game with nature, we present an invariant description of the control with a horizon equal to one, which allows to perform computations in two ways: using Monte-Carlo simulations and analytically by dynamic programming technique. For various configurations of the considered game with nature, we have found saddle points, which characterize the optimal control and the worst-case distribution of the parameters of the multi-armed bandit.

Keywords: multi-armed bandit problem, Gaussian multi-armed bandit, minimax approach, UCB rule, invariant description, Monte-Carlo simulations, dynamic programming.

Funding agency	Grant number
Russian Foundation for Basic Research	20-01-00062

Received: 10.10.2021
Revised: 03.03.2022
Accepted: 16.05.2022

Bibliographic databases:

Document Type: Article

UDC: 519.832, 519.245

BBC: 22.18

Language: Russian

Citation: Sergey V. Garbar, Alexander V. Kolnogorov, “Customization of J. Bather UCB strategy for a Gaussian multi-armed bandit”, Mat. Teor. Igr Pril., 14:2 (2022), 3–30

Citation in format AMSBIB

\Bibitem{GarKol22}

\by Sergey~V.~Garbar, Alexander~V.~Kolnogorov

\paper Customization of J.~Bather UCB strategy for a Gaussian multi-armed bandit

\jour Mat. Teor. Igr Pril.

\yr 2022

\vol 14

\issue 2

\pages 3--30

\mathnet{http://mi.mathnet.ru/mgta299}

\mathscinet{http://mathscinet.ams.org/mathscinet-getitem?mr=4459156}

Linking options:

https://www.mathnet.ru/eng/mgta299

https://www.mathnet.ru/eng/mgta/v14/i2/p3

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Математическая теория игр и её приложения

Statistics & downloads:
Abstract page:	129
Full-text PDF :	53
References:	23

Что такое QR-код?

Registration to the website

Logotypes