A. V. Kolnogorov, “Invariant description of control in a Gaussian one-armed bandit problem”, Vestnik YuUrGU. Ser. Mat. Model. Progr., 17:1 (2024), 27

Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Submit a manuscript

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Vestnik YuUrGU. Ser. Mat. Model. Progr.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie, 2024, Volume 17, Issue 1, Pages 27–36
DOI: https://doi.org/10.14529/mmp240103 (Mi vyuru709)

Mathematical Modelling

Invariant description of control in a Gaussian one-armed bandit problem

A. V. Kolnogorov

Yaroslav-the-Wise Novgorod State University, Veliky Novgorod, Russian Federation

Full-text PDF (248 kB)

References:

PDF

HTML

DOI: https://doi.org/10.14529/mmp240103

Abstract: We consider the one-armed bandit problem in application to batch data processing if there are two alternative processing methods with different efficiencies and the efficiency of the second method is a priori unknown. During the processing, it is necessary to determine the most effective method and ensure its preferential use. Processing is performed in batches, so the distributions of incomes are Gaussian. We consider the case of a priori unknown mathematical expectation and the variance of income corresponding to the second action. This case describes a situation when the batches themselves and their number have moderate or small volumes. We obtain recursive equations for computing the Bayesian risk and regret, which we then present in an invariant form with a control horizon equal to one. This makes it possible to obtain the estimates of Bayesian and minimax risk that are valid for all control horizons multiples to the number of processed batches.

Keywords: one-armed bandit, batch processing, Bayesian and minimax approaches, invariant description.

Funding agency	Grant number
Russian Science Foundation	23-21-00447
The research was supported by Russian Science Foundation, project number 23-21-00447, https://rscf.ru/en/project/23-21-00447/.

Received: 22.11.2023

Document Type: Article

UDC: 519.244, 519.83

MSC: 62C10, 62L05, 91A35

Language: English

Citation: A. V. Kolnogorov, “Invariant description of control in a Gaussian one-armed bandit problem”, Vestnik YuUrGU. Ser. Mat. Model. Progr., 17:1 (2024), 27–36

Citation in format AMSBIB

\Bibitem{Kol24}

\by A.~V.~Kolnogorov

\paper Invariant description of control in a Gaussian one-armed bandit problem

\jour Vestnik YuUrGU. Ser. Mat. Model. Progr.

\yr 2024

\vol 17

\issue 1

\pages 27--36

\mathnet{http://mi.mathnet.ru/vyuru709}

\crossref{https://doi.org/10.14529/mmp240103}

Linking options:

https://www.mathnet.ru/eng/vyuru709

https://www.mathnet.ru/eng/vyuru/v17/i1/p27

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	53
Full-text PDF :	22
References:	14

Что такое QR-код?

Registration to the website

Logotypes