I. K. Minashina, R. A. Gorbachev, E. M. Zakharova, “Scheduling in multiagent systems using reinforcement learning”, Dokl. RAN. Math. Inf. Proc. Upr., 508 (2022), 79–87; Dokl. Math., 106:suppl. 1 (2022), S70

Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive
	Impact factor

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Dokl. RAN. Math. Inf. Proc. Upr.:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia, 2022, Volume 508, Pages 79–87
DOI: https://doi.org/10.31857/S2686954322070177 (Mi danma340)

ADVANCED STUDIES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Scheduling in multiagent systems using reinforcement learning

I. K. Minashina, R. A. Gorbachev, E. M. Zakharova

Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow Region

References:

PDF

HTML

DOI: https://doi.org/10.31857/S2686954322070177

Abstract: The paper is devoted to scheduling in multiagent systems in the framework of the Flatland 3 competition. The main aim of this competition is to develop an algorithm for the effective control of dense traffic in complex railroad networks according to a given schedule. The proposed solution is based on reinforcement learning. To adapt this method to the particular scheduling problem, a novel approach based on structuring the reward function that stimulates an agent to adhere to its schedule was developed. The architecture of the proposed model is based on a multiagent version of centralized critic with proximal policy optimization (PPO) learning. In addition, a curriculum learning strategy was developed and implemented. This allowed the agent to cope with each level of complexity on time and train the model in more difficult conditions. The proposed solution won first place in the Flatland 3 competition in the reinforcement learning track.

Keywords: reinforcement learning, multiagent systems, railroads, Flatland, reward function structuring, curriculum learning, centralized critic.

Presented: A. L. Semenov
Received: 28.10.2022
Revised: 28.10.2022
Accepted: 01.11.2022

English version:
Doklady Mathematics, 2022, Volume 106, Issue suppl. 1, Pages S70–S78
DOI: https://doi.org/10.1134/S1064562422060175

Bibliographic databases:

Document Type: Article

UDC: 004.8

Language: Russian

Citation: I. K. Minashina, R. A. Gorbachev, E. M. Zakharova, “Scheduling in multiagent systems using reinforcement learning”, Dokl. RAN. Math. Inf. Proc. Upr., 508 (2022), 79–87; Dokl. Math., 106:suppl. 1 (2022), S70–S78

Citation in format AMSBIB

\Bibitem{MinGorZak22}

\by I.~K.~Minashina, R.~A.~Gorbachev, E.~M.~Zakharova

\paper Scheduling in multiagent systems using reinforcement learning

\jour Dokl. RAN. Math. Inf. Proc. Upr.

\yr 2022

\vol 508

\pages 79--87

\mathnet{http://mi.mathnet.ru/danma340}

\crossref{https://doi.org/10.31857/S2686954322070177}

\elib{https://elibrary.ru/item.asp?id=49991313}

\transl

\jour Dokl. Math.

\yr 2022

\vol 106

\issue suppl. 1

\pages S70--S78

\crossref{https://doi.org/10.1134/S1064562422060175}

Linking options:

https://www.mathnet.ru/eng/danma340

https://www.mathnet.ru/eng/danma/v508/p79

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia

Statistics & downloads:
Abstract page:	53
References:	9

Что такое QR-код?

Registration to the website

Logotypes