Self and Other Modelling in Cooperative Resource Gathering with Multi-Agent Reinforcement Learning

Authors

Panov A.

Annotation

In this work, we explore the application of the Self-other-Modelling algorithm (SOM) to several agent architectures for the collaborative grid-based environment. Asynchronous Advantage Actor-Critic (A3C) algorithm was compared with the OpenAI Hide-and-seek (HNS) agent. We expand their implementation by adding the SOM algorithm. As an extension of the original environment, we add a stochastic initialization version of the environment. To address the lack of performance in such an environment by all versions of agents, we made further improvements over the A3C and HNS agents, adding the module dedicated to the SOM algorithm. This agent was able to efficiently solve a stochastically initialized version of the environment, showing the potential benefits of such an approach.

External links

DOI: 10.1007/978-3-030-65596-9_9

At the BICA*AI 2020 website: https://bica2020.org/speakers/

Scopus: https://www.scopus.com/record/display.uri?eid=2-s2.0-85098220428&origin=resultslist

eLibrary: https://elibrary.ru/item.asp?id=45050526

ResearchGate: https://www.researchgate.net/publication/347461653_Self_and_Other_Modelling_in_Cooperative_Resource_Gathering_with_Multi-agent_Reinforcement_Learning

Reference link

Davydov, V., Liusko, T., Panov, A. I. (2021). Self and Other Modelling in Cooperative Resource Gathering with Multi-agent Reinforcement Learning // In: Samsonovich, A.V., Gudwin, R.R., Simões, A.d.S. (eds) Brain-Inspired Cognitive Architectures for Artificial Intelligence: BICA*AI 2020. BICA 2020. Advances in Intelligent Systems and Computing, vol 1310. Springer, Cham. Pp. 69–77.