Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestnik YuUrGU. Ser. Mat. Model. Progr.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie, 2024, Volume 17, Issue 1, Pages 27–36
DOI: https://doi.org/10.14529/mmp240103
(Mi vyuru709)
 

Mathematical Modelling

Invariant description of control in a Gaussian one-armed bandit problem

A. V. Kolnogorov

Yaroslav-the-Wise Novgorod State University, Veliky Novgorod, Russian Federation
References:
Abstract: We consider the one-armed bandit problem in application to batch data processing if there are two alternative processing methods with different efficiencies and the efficiency of the second method is a priori unknown. During the processing, it is necessary to determine the most effective method and ensure its preferential use. Processing is performed in batches, so the distributions of incomes are Gaussian. We consider the case of a priori unknown mathematical expectation and the variance of income corresponding to the second action. This case describes a situation when the batches themselves and their number have moderate or small volumes. We obtain recursive equations for computing the Bayesian risk and regret, which we then present in an invariant form with a control horizon equal to one. This makes it possible to obtain the estimates of Bayesian and minimax risk that are valid for all control horizons multiples to the number of processed batches.
Keywords: one-armed bandit, batch processing, Bayesian and minimax approaches, invariant description.
Funding agency Grant number
Russian Science Foundation 23-21-00447
The research was supported by Russian Science Foundation, project number 23-21-00447, https://rscf.ru/en/project/23-21-00447/.
Received: 22.11.2023
Document Type: Article
UDC: 519.244, 519.83
MSC: 62C10, 62L05, 91A35
Language: English
Citation: A. V. Kolnogorov, “Invariant description of control in a Gaussian one-armed bandit problem”, Vestnik YuUrGU. Ser. Mat. Model. Progr., 17:1 (2024), 27–36
Citation in format AMSBIB
\Bibitem{Kol24}
\by A.~V.~Kolnogorov
\paper Invariant description of control in a Gaussian one-armed bandit problem
\jour Vestnik YuUrGU. Ser. Mat. Model. Progr.
\yr 2024
\vol 17
\issue 1
\pages 27--36
\mathnet{http://mi.mathnet.ru/vyuru709}
\crossref{https://doi.org/10.14529/mmp240103}
Linking options:
  • https://www.mathnet.ru/eng/vyuru709
  • https://www.mathnet.ru/eng/vyuru/v17/i1/p27
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Statistics & downloads:
    Abstract page:53
    Full-text PDF :22
    References:14
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024