Abstract
The ability to plan actions on multiple levels of abstraction enables intelligent agents to solve complex tasks effectively. However, learning the models for both low and high-level planning from demonstrations has proven challenging, especially with higher-dimensional inputs. To address this issue, we propose to use reinforcement learning to identify subgoals in expert trajectories by associating the magnitude of the rewards with the predictability of low-level actions given the state and the chosen subgoal. We build a vector-quantized generative model for the identified subgoals to perform subgoal-level planning. In experiments, the algorithm excels at solving complex, long-horizon decision-making problems outperforming state-of-the-art. Because of its ability to plan, our algorithm can find better trajectories than the ones in the training set.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
| Editors | Andread Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett |
| Publisher | JMLR |
| Pages | 17896-17919 |
| Number of pages | 24 |
| Publication status | Published - Jul 2023 |
| MoE publication type | A4 Conference publication |
| Event | International Conference on Machine Learning - Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 Conference number: 40 |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Publisher | PMLR |
| Volume | 202 |
| ISSN (Electronic) | 2640-3498 |
Conference
| Conference | International Conference on Machine Learning |
|---|---|
| Abbreviated title | ICML |
| Country/Territory | United States |
| City | Honolulu |
| Period | 23/07/2023 → 29/07/2023 |
Fingerprint
Dive into the research topics of 'Hierarchical Imitation Learning with Vector Quantized Models'. Together they form a unique fingerprint.Projects
- 2 Finished
-
Safe: Turvallinen vahvistusoppiminen epästationaarisissa ympäristöissä nopealla sopeutumisella ja häiriöennustuksella
Pajarinen, J. (Principal investigator), Zhao, Y. (Project Member) & Kostin, N. (Project Member)
01/01/2022 → 31/12/2024
Project: RCF Other
-
-: Finnish Center for Artificial Intelligence
Kaski, S. (Principal investigator)
01/01/2019 → 31/12/2022
Project: Academy of Finland: Other research funding
Equipment
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver