Projects per year
Abstract
Offline reinforcement learning (RL) allows learning sequential behavior from fixed datasets. Since offline datasets do not cover all possible situations, many methods collect additional data during online fine-tuning to improve performance. In general, these methods assume that the transition dynamics remain the same during both the offline and online phases of training. However, in many real-world applications, such as outdoor construction and navigation over rough terrain, it is common for the transition dynamics to vary between the offline and online phases. Moreover, the dynamics may vary during the online fine-tuning. To address this problem of changing dynamics from offline to online RL we propose a residual learning approach that infers dynamics changes to correct the outputs of the offline solution. At the online fine-tuning phase, we train a context encoder to learn a representation that is consistent inside the current online learning environment while being able to predict dynamic transitions.
Original language | English |
---|---|
Pages (from-to) | 1107-1121 |
Number of pages | 15 |
Journal | Proceedings of Machine Learning Research |
Volume | 242 |
Publication status | Published - 2024 |
MoE publication type | A4 Conference publication |
Event | Learning for Dynamics and Control Conference - Oxford, United Kingdom Duration: 15 Jul 2024 → 17 Jul 2024 |
Keywords
- Adaptive RL
- Context Encoding
- Offline-to-Online RL
Fingerprint
Dive into the research topics of 'Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning'. Together they form a unique fingerprint.-
Safe: Turvallinen vahvistusoppiminen epästationaarisissa ympäristöissä nopealla sopeutumisella ja häiriöennustuksella
Pajarinen, J. (Principal investigator), Kostin, N. (Project Member) & Zhao, Y. (Project Member)
01/01/2022 → 31/12/2024
Project: Academy of Finland: Other research funding
-
-: Finnish Center for Artificial Intelligence
Kaski, S. (Principal investigator)
01/01/2019 → 31/12/2022
Project: Academy of Finland: Other research funding