Discrete Codebook World Models for Continuous Control

Aidan Scannell, Mohammadreza Nakhaei, Kalle Kujanpää, Yi Zhao, Kevin Sebastian Luck, Arno Solin, Joni Pajarinen

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

8 Downloads (Pure)

Abstract

In reinforcement learning (RL), world models serve as internal simulators, enabling agents to predict environment dynamics and future outcomes in order to make informed decisions. While previous approaches leveraging discrete latent spaces, such as DreamerV3, have demonstrated strong performance in discrete action settings and visual control tasks, their comparative performance in state-based continuous control remains underexplored. In contrast, methods with continuous latent spaces, such as TD-MPC2, have shown notable success in state-based continuous control benchmarks. In this paper, we demonstrate that modeling discrete latent states has benefits over continuous latent states and that discrete codebook encodings are more effective representations for continuous control, compared to alternative encodings, such as one-hot and label-based encodings. Based on these insights, we introduce DCWM: Discrete Codebook World Model, a self-supervised world model with a discrete and stochastic latent space, where latent states are codes from a codebook. We combine DCWM with decision-time planning to get our model-based RL algorithm, named DC-MPC: Discrete Codebook Model Predictive Control, which performs competitively against recent state-of-the-art algorithms, including TD-MPC2 and DreamerV3, on continuous control benchmarks. See our project website www.aidanscannell.com/dcmpc.

Original languageEnglish
Title of host publication13th International Conference on Learning Representations, ICLR 2025
PublisherCurran Associates Inc.
Pages54754-54791
Number of pages38
ISBN (Electronic)9798331320850
Publication statusPublished - 2025
MoE publication typeA4 Conference publication
EventInternational Conference on Learning Representations - Singapore, Singapore, Singapore
Duration: 24 Apr 202528 Apr 2025
Conference number: 13
https://iclr.cc/

Conference

ConferenceInternational Conference on Learning Representations
Abbreviated titleICLR
Country/TerritorySingapore
CitySingapore
Period24/04/202528/04/2025
Internet address

Fingerprint

Dive into the research topics of 'Discrete Codebook World Models for Continuous Control'. Together they form a unique fingerprint.
  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this