Projects per year
Abstract
In reinforcement learning (RL), world models serve as internal simulators, enabling agents to predict environment dynamics and future outcomes in order to make informed decisions. While previous approaches leveraging discrete latent spaces, such as DreamerV3, have demonstrated strong performance in discrete action settings and visual control tasks, their comparative performance in state-based continuous control remains underexplored. In contrast, methods with continuous latent spaces, such as TD-MPC2, have shown notable success in state-based continuous control benchmarks. In this paper, we demonstrate that modeling discrete latent states has benefits over continuous latent states and that discrete codebook encodings are more effective representations for continuous control, compared to alternative encodings, such as one-hot and label-based encodings. Based on these insights, we introduce DCWM: Discrete Codebook World Model, a self-supervised world model with a discrete and stochastic latent space, where latent states are codes from a codebook. We combine DCWM with decision-time planning to get our model-based RL algorithm, named DC-MPC: Discrete Codebook Model Predictive Control, which performs competitively against recent state-of-the-art algorithms, including TD-MPC2 and DreamerV3, on continuous control benchmarks. See our project website www.aidanscannell.com/dcmpc.
| Original language | English |
|---|---|
| Title of host publication | 13th International Conference on Learning Representations, ICLR 2025 |
| Publisher | Curran Associates Inc. |
| Pages | 54754-54791 |
| Number of pages | 38 |
| ISBN (Electronic) | 9798331320850 |
| Publication status | Published - 2025 |
| MoE publication type | A4 Conference publication |
| Event | International Conference on Learning Representations - Singapore, Singapore, Singapore Duration: 24 Apr 2025 → 28 Apr 2025 Conference number: 13 https://iclr.cc/ |
Conference
| Conference | International Conference on Learning Representations |
|---|---|
| Abbreviated title | ICLR |
| Country/Territory | Singapore |
| City | Singapore |
| Period | 24/04/2025 → 28/04/2025 |
| Internet address |
Fingerprint
Dive into the research topics of 'Discrete Codebook World Models for Continuous Control'. Together they form a unique fingerprint.-
MARL: Efficient and Principled Multi-Agent Reinforcement Learning
Pajarinen, J. (Principal investigator), Tuomisto, J. (Project Member), Yang, W. (Project Member), Zhao, Y. (Project Member), Zhao, W. (Project Member) & Xuan, C. (Project Member)
01/09/2023 → 31/08/2027
Project: RCF Academy Project
-
Solin Arno /AoF Fellow Salary: Probabilistic principles for latent space exploration in deep learning
Solin, A. (Principal investigator) & Mereu, R. (Project Member)
01/09/2021 → 31/08/2026
Project: RCF Academy Research Fellow (new)
-
BIOND: BIOND
Pajarinen, J. (Principal investigator) & Nakhaei, M. (Project Member)
01/08/2023 → 31/07/2025
Project: BF Co-Innovation