Семинары
RUS  ENG    ЖУРНАЛЫ   ПЕРСОНАЛИИ   ОРГАНИЗАЦИИ   КОНФЕРЕНЦИИ   СЕМИНАРЫ   ВИДЕОТЕКА   ПАКЕТ AMSBIB  
Календарь
Поиск
Регистрация семинара

RSS
Ближайшие семинары




Коллоквиум Факультета компьютерных наук НИУ ВШЭ
26 сентября 2023 г. 16:20–17:40, г. Москва, Покровский бульвар 11
 


Linear Stochastic Approximation Error Bounds with Applications to Reinforcement Learning

Sergey Samsonov

Количество просмотров:
Эта страница:128
Youtube:



Аннотация: In this talk, we discuss a finite-time analysis of linear stochastic approximation (LSA) algorithms with a fixed step size, a core method in statistics and machine learning. We cover the setting of both independent and identically distributed noise variables and a uniformly geometrically ergodic Markov chain. We derive p-th moment and high-probability deviation bounds for the iterates defined by LSA and its Polyak-Ruppert averaged version. Our finite-time instance-dependent bounds for the averaged LSA iterates are sharp in the sense that the leading term we obtain coincides with the local asymptotic minimax limit. Moreover, the remainder terms of our bounds have a tight dependence on the mixing time of the underlying noise sequence. Our results yield new finite-time error bounds for temporal difference learning with linear functional approximation and instance-independent step size. The talk is based on the recent paper https://arxiv.org/abs/2207.04475.

Website: https://cs.hse.ru/announcements/859337201.html
 
  Обратная связь:
 Пользовательское соглашение  Регистрация посетителей портала  Логотипы © Математический институт им. В. А. Стеклова РАН, 2024