A. Iskandar, A. Hammoud, B. Kovács, “Implicit understanding: decoding swarm behaviors in robots through deep inverse reinforcement learning”, Informatics and Automation, 23:5 (2024), 1485

Loading [MathJax]/jax/output/SVG/config.js

Informatics and Automation

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Informatics and Automation:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Informatics and Automation, 2024, Issue 23, volume 5, Pages 1485–1504
DOI: https://doi.org/10.15622/ia.23.5.8 (Mi trspy1331)

Robotics, Automation and Control Systems

Implicit understanding: decoding swarm behaviors in robots through deep inverse reinforcement learning

A. Iskandar^a, A. Hammoud^b, B. Kovács^a

^a University of Miskolc
^b Federal State Budgetary Educational Institution of Higher Education “Kuban State Agrarian University named after I.T. Trubilin”

Full-text PDF (2507 kB)

DOI: https://doi.org/10.15622/ia.23.5.8

Abstract: Using reinforcement learning to generate the collective behavior of swarm robots is a common approach. Yet, formulating an appropriate reward function that aligns with specific objectives remains a significant challenge, particularly as the complexity of tasks increases. In this paper, we develop a deep inverse reinforcement learning model to uncover the reward structures that guide autonomous robots in achieving tasks by demonstrations. Deep inverse reinforcement learning models are particularly well-suited for complex and dynamic environments where predefined reward functions may be difficult to specify. Our model can generate different collective behaviors according to the required objectives and effectively copes with continuous state and action spaces, ensuring a nuanced recovery of reward structures. We tested the model using E-puck robots in the Webots simulator to solve two tasks: searching for dispersed boxes and navigation to a predefined position. Receiving rewards depends on demonstrations collected by an intelligent pre-trained swarm using reinforcement learning act as an expert. The results show successful recovery of rewards in both segmented and continuous demonstrations for two behaviors – searching and navigation. By observing the learned behaviors of the swarm by the expert and proposed model, it is noticeable that the model does not merely clone the expert behavior but generates its own strategies to achieve the system’s objectives.

Keywords: deep inverse reinforcement learning, reward function, demonstrations, searching behavior, navigation behavior.

Received: 29.05.2024

Document Type: Article

UDC: 006.72

Language: English

Citation: A. Iskandar, A. Hammoud, B. Kovács, “Implicit understanding: decoding swarm behaviors in robots through deep inverse reinforcement learning”, Informatics and Automation, 23:5 (2024), 1485–1504

Citation in format AMSBIB

\Bibitem{IskHamKov24}

\by A.~Iskandar, A.~Hammoud, B.~Kov\'acs

\paper Implicit understanding: decoding swarm behaviors in robots through deep inverse reinforcement learning

\jour Informatics and Automation

\yr 2024

\vol 23

\issue 5

\pages 1485--1504

\mathnet{http://mi.mathnet.ru/trspy1331}

\crossref{https://doi.org/10.15622/ia.23.5.8}

Linking options:

https://www.mathnet.ru/eng/trspy1331

https://www.mathnet.ru/eng/trspy/v23/i5/p1485

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Statistics & downloads:
Abstract page:	19
Full-text PDF :	6

Registration to the website

Logotypes