Seminars
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
Calendar
Search
Add a seminar

RSS
Forthcoming seminars




Principle Seminar of the Department of Probability Theory, Moscow State University
October 19, 2011 16:45, Moscow, MSU, auditorium 16-24
 


Robust Parallel Control in a Random Environment (the Two-Armed Bandit Problem)

A. V. Kolnogorov

Novgorod State University
Supplementary materials:
Adobe PDF 2.4 Mb

Abstract: The problem of expedient behavior in a stationary environment which is also well-known as the two-armed bandit problem is considered in robust (minimax) setting. Minimax strategy and risk are found as Bayes' ones corresponding to the worst prior distribution. For environments which incomes have normal distributions with unit variances and expectations depending on applied alternatives only this prior distribution can be chosen a symmetric and asymptotically uniform one.
A parallel control strategy is proposed which provides arbitrary close to optimal control. An invariant recurrent equation is obtained for finding the minimax strategy and minimax risk by dynamic programming method. This allows to improve well-known W.Vogel's estimates of the minimax risk. A numerical analysis shows that the strategy performs well in stationary environments which distributions are different from normal ones, e.g. in binary Bernoulli environments.

Supplementary materials: normal.pdf (2.4 Mb)
 
  Contact us:
 Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024