Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive
Impact factor

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Dokl. RAN. Math. Inf. Proc. Upr.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia, 2022, Volume 508, Pages 79–87
DOI: https://doi.org/10.31857/S2686954322070177
(Mi danma340)
 

ADVANCED STUDIES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Scheduling in multiagent systems using reinforcement learning

I. K. Minashina, R. A. Gorbachev, E. M. Zakharova

Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow Region
References:
Abstract: The paper is devoted to scheduling in multiagent systems in the framework of the Flatland 3 competition. The main aim of this competition is to develop an algorithm for the effective control of dense traffic in complex railroad networks according to a given schedule. The proposed solution is based on reinforcement learning. To adapt this method to the particular scheduling problem, a novel approach based on structuring the reward function that stimulates an agent to adhere to its schedule was developed. The architecture of the proposed model is based on a multiagent version of centralized critic with proximal policy optimization (PPO) learning. In addition, a curriculum learning strategy was developed and implemented. This allowed the agent to cope with each level of complexity on time and train the model in more difficult conditions. The proposed solution won first place in the Flatland 3 competition in the reinforcement learning track.
Keywords: reinforcement learning, multiagent systems, railroads, Flatland, reward function structuring, curriculum learning, centralized critic.
Presented: A. L. Semenov
Received: 28.10.2022
Revised: 28.10.2022
Accepted: 01.11.2022
English version:
Doklady Mathematics, 2022, Volume 106, Issue suppl. 1, Pages S70–S78
DOI: https://doi.org/10.1134/S1064562422060175
Bibliographic databases:
Document Type: Article
UDC: 004.8
Language: Russian
Citation: I. K. Minashina, R. A. Gorbachev, E. M. Zakharova, “Scheduling in multiagent systems using reinforcement learning”, Dokl. RAN. Math. Inf. Proc. Upr., 508 (2022), 79–87; Dokl. Math., 106:suppl. 1 (2022), S70–S78
Citation in format AMSBIB
\Bibitem{MinGorZak22}
\by I.~K.~Minashina, R.~A.~Gorbachev, E.~M.~Zakharova
\paper Scheduling in multiagent systems using reinforcement learning
\jour Dokl. RAN. Math. Inf. Proc. Upr.
\yr 2022
\vol 508
\pages 79--87
\mathnet{http://mi.mathnet.ru/danma340}
\crossref{https://doi.org/10.31857/S2686954322070177}
\elib{https://elibrary.ru/item.asp?id=49991313}
\transl
\jour Dokl. Math.
\yr 2022
\vol 106
\issue suppl. 1
\pages S70--S78
\crossref{https://doi.org/10.1134/S1064562422060175}
Linking options:
  • https://www.mathnet.ru/eng/danma340
  • https://www.mathnet.ru/eng/danma/v508/p79
  • Citing articles in Google Scholar: Russian citations, English citations
    Related articles in Google Scholar: Russian articles, English articles
    Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia
    Statistics & downloads:
    Abstract page:53
    References:9
     
      Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2024