Семинары: Sergey Samsonov, Linear Stochastic Approximation Error Bounds with Applications to Reinforcement Learning

Loading [MathJax]/jax/output/SVG/config.js

Семинары

RUS ENG

ЖУРНАЛЫ ПЕРСОНАЛИИ ОРГАНИЗАЦИИ КОНФЕРЕНЦИИ СЕМИНАРЫ ВИДЕОТЕКА ПАКЕТ AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	Календарь
	Поиск
	Регистрация семинара

	RSS
	Ближайшие семинары

Коллоквиум Факультета компьютерных наук НИУ ВШЭ
26 сентября 2023 г. 16:20–17:40, г. Москва, Покровский бульвар 11

Linear Stochastic Approximation Error Bounds with Applications to Reinforcement Learning

Sergey Samsonov

Количество просмотров:
Эта страница:	206
Youtube:	525

https://www.youtube.com/watch?v=Wyg0VHiFBac

Аннотация: In this talk, we discuss a finite-time analysis of linear stochastic approximation (LSA) algorithms with a fixed step size, a core method in statistics and machine learning. We cover the setting of both independent and identically distributed noise variables and a uniformly geometrically ergodic Markov chain. We derive p-th moment and high-probability deviation bounds for the iterates defined by LSA and its Polyak-Ruppert averaged version. Our finite-time instance-dependent bounds for the averaged LSA iterates are sharp in the sense that the leading term we obtain coincides with the local asymptotic minimax limit. Moreover, the remainder terms of our bounds have a tight dependence on the mixing time of the underlying noise sequence. Our results yield new finite-time error bounds for temporal difference learning with linear functional approximation and instance-independent step size. The talk is based on the recent paper https://arxiv.org/abs/2207.04475.

Website: https://cs.hse.ru/announcements/859337201.html

Обратная связь:
math-net2025_05@mi-ras.ru

Пользовательское соглашение

Регистрация посетителей портала

Логотипы