In this paper, we introduce ObjectZero, a novel reinforcement learning (RL) algorithm that leverages the power of object-level representations to model dynamic environments more effectively. Unlike traditional approaches that process the world as a single undifferentiated input, our method employs Graph Neural Networks (GNNs) to capture intricate interactions among multiple objects. These objects, which can be manipulated and interact with each other, serve as the foundation for our model’s understanding of the environment. We trained the algorithm in a complex setting teeming with diverse, interactive objects, demonstrating its ability to effectively learn and predict object dynamics. Our results highlight that a structured world model operating on object-centric representations can be successfully integrated into a model-based RL algorithm utilizing Monte Carlo Tree Search as a planning module.
DOI: 10.1007/978-3-032-13612-1_42
Download the proceedings from Springer Nature: https://link.springer.com/content/pdf/10.1007/978-3-032-13612-1.pdf
ResearchGate: https://www.researchgate.net/publication/399128067_Object-Centric_World_Models_Meet_Monte_Carlo_Tree_Search
Vakhitov, R., Ugadiarov, L., Panov, A. (2026). Object-Centric World Models Meet Monte Carlo Tree Search // Proceedings of the Ninth International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’25), Volume 2, pp. 481–491.