Avtomatika i Telemekhanika
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor
Guidelines for authors
Submit a manuscript

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Avtomat. i Telemekh.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Avtomatika i Telemekhanika, 2012, Issue 4, Pages 114–130 (Mi at3793)  

This article is cited in 15 scientific papers (total in 15 papers)

Robust and Adaptive Systems

Parallel design of robust control in the stochastic environment (the two-armed bandit problem)

A. V. Kolnogorov

Yaroslav-the-Wise Novgorod State University, Velikii Novgorod, Russia
References:
Abstract: The problem of rational behavior in the stochastic environment, also known as the two armed bandit problem, is considered in the robust (minimax) setting. A parallel strategy is proposed leading to control, which is arbitrary close to the optimal one for environments with gains having gaussian cumulative distribution functions with unit variance. The invariant recursive equation is obtained for computing the minimax strategy and risk, which are to be found as Bayesian ones associated with the worst-case a priori distribution. As a result, the well-known Vogel's estimates of the minimax risk can be improved. Numerical experiments show that the strategy is efficient in the environments with non-gaussian distributions, e.g., the binary ones.
Presented by the member of Editorial Board: A. V. Nazin

Received: 24.11.2010
English version:
Automation and Remote Control, 2012, Volume 73, Issue 4, Pages 689–701
DOI: https://doi.org/10.1134/S000511791204008X
Bibliographic databases:
Document Type: Article
Language: Russian
Citation: A. V. Kolnogorov, “Parallel design of robust control in the stochastic environment (the two-armed bandit problem)”, Avtomat. i Telemekh., 2012, no. 4, 114–130; Autom. Remote Control, 73:4 (2012), 689–701
Citation in format AMSBIB
\Bibitem{Kol12}
\by A.~V.~Kolnogorov
\paper Parallel design of robust control in the stochastic environment (the two-armed bandit problem)
\jour Avtomat. i Telemekh.
\yr 2012
\issue 4
\pages 114--130
\mathnet{http://mi.mathnet.ru/at3793}
\transl
\jour Autom. Remote Control
\yr 2012
\vol 73
\issue 4
\pages 689--701
\crossref{https://doi.org/10.1134/S000511791204008X}
\isi{https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=Publons&SrcAuth=Publons_CEL&DestLinkType=FullRecord&DestApp=WOS_CPL&KeyUT=000302809600008}
\scopus{https://www.scopus.com/record/display.url?origin=inward&eid=2-s2.0-84862121337}
Linking options:
  • https://www.mathnet.ru/eng/at3793
  • https://www.mathnet.ru/eng/at/y2012/i4/p114
  • This publication is cited in the following 15 articles:
    Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Avtomatika i Telemekhanika
    Statistics & downloads:
    Abstract page:280
    Full-text PDF :78
    References:35
    First page:18
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024