Self and Other Modelling in Cooperative Resource Gathering with Multi-Agent Reinforcement Learning

Авторы

Панов А. И.

Аннотация

In this work, we explore the application of the Self-other-Modelling algorithm (SOM) to several agent architectures for the collaborative grid-based environment. Asynchronous Advantage Actor-Critic (A3C) algorithm was compared with the OpenAI Hide-and-seek (HNS) agent. We expand their implementation by adding the SOM algorithm. As an extension of the original environment, we add a stochastic initialization version of the environment. To address the lack of performance in such an environment by all versions of agents, we made further improvements over the A3C and HNS agents, adding the module dedicated to the SOM algorithm. This agent was able to efficiently solve a stochastically initialized version of the environment, showing the potential benefits of such an approach.

Внешние ссылки

DOI: 10.1007/978-3-030-65596-9_9

На сайте BICA*AI 2020: https://bica2020.org/speakers/

Scopus: https://www.scopus.com/record/display.uri?eid=2-s2.0-85098220428&origin=resultslist

РИНЦ: https://elibrary.ru/item.asp?id=45050526

ResearchGate: https://www.researchgate.net/publication/347461653_Self_and_Other_Modelling_in_Cooperative_Resource_Gathering_with_Multi-agent_Reinforcement_Learning

Ссылка при цитировании

Давыдов В., Люсько Т., Панов А. И. Self and Other Modelling in Cooperative Resource Gathering with Multi-Agent Reinforcement Learning // Brain-Inspired Cognitive Architectures for Artificial Intelligence: BICA*AI 2020. — Advances in Intelligent Systems and Computing, Vol.1310. — С. 69–77.