Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Авторы

Панов А. И. Скрынник А. А. Кудеров П. В.

Аннотация

In this study, we address the issue of enabling an artificial intelligence agent to execute complex language instructions within virtual environments. In our framework, we assume that these instructions involve intricate linguistic structures and multiple interdependent tasks that must be navigated successfully to achieve the desired outcomes. To effectively manage these complexities, we propose a hierarchical framework that combines the deep language comprehension of large language models with the adaptive action-execution capabilities of reinforcement learning agents. The language module (based on LLM) translates the language instruction into a high-level action plan, which is then executed by a pre-trained reinforcement learning agent. We have demonstrated the effectiveness of our approach in two different environments: in IGLU, where agents are instructed to build structures, and in Crafter, where agents perform tasks and interact with objects in the surrounding environment according to language commands.

Внешние ссылки

DOI: 10.3233/FAIA240545

Скачать статью (PDF) на сайте издателя IOS Press (англ.): https://ebooks.iospress.nl/volumearticle/69640

Скачать сборник трудов конференции (PDF) на сайте издателя IOS Press (англ.): https://ebooks.iospress.nl/doi/10.3233/FAIA392

Скачать PDF на arXiv.org (англ.): https://arxiv.org/abs/2407.09287

ResearchGate: https://www.researchgate.net/publication/382251632_Instruction_Following_with_Goal-Conditioned_Reinforcement_Learning_in_Virtual_Environments

Ссылка при цитировании

Zoya Volovikova, Alexey Skrynnik, Petr Kuderov, Aleksandr I. Panov. Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments // Proceedings of ECAI-2024, the 27th European Conference on Artificial Intelligence, 19–24 October 2024, Santiago de Compostela, Spain — Including 13th Conference on Prestigious Applications of Intelligent Systems (PAIS 2024). Volume 392. IOS Press, 2024. Pp. 650–657.