Projects per year
Abstract
In recent years, domain randomization over dynamics parameters has gained a lot of traction as a method for sim-to-real transfer of reinforcement learning policies in robotic manipulation; however, finding optimal randomization distributions can be difficult. In this paper, we introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer. Unlike prior work, DROPO only requires a limited, precollected offline dataset of trajectories, and explicitly models parameter uncertainty to match real data using a likelihood-based approach. We demonstrate that DROPO is capable of recovering dynamic parameter distributions in simulation and finding a distribution capable of compensating for an unmodeled phenomenon. We also evaluate the method in two zero-shot sim-to-real transfer scenarios, showing successful domain transfer and improved performance over prior methods.
Original language | English |
---|---|
Article number | 104432 |
Number of pages | 15 |
Journal | Robotics and Autonomous Systems |
Volume | 166 |
DOIs | |
Publication status | Published - Aug 2023 |
MoE publication type | A1 Journal article-refereed |
Keywords
- Domain randomization
- Reinforcement learning
- Robot learning
- Transfer learning
Fingerprint
Dive into the research topics of 'DROPO: Sim-to-real transfer with offline domain randomization'. Together they form a unique fingerprint.Projects
- 2 Finished
-
-: Bridging the Reality Gap in Autonomous Learning
Kyrki, V. (Principal investigator), Alcan, G. (Project Member), Arndt, K. (Project Member) & Blanco Mulero, D. (Project Member)
01/01/2020 → 31/12/2022
Project: Academy of Finland: Other research funding
-
-: AI spider silk threading
Kyrki, V. (Principal investigator), Arndt, K. (Project Member), Petrik, V. (Project Member) & Blanco Mulero, D. (Project Member)
01/01/2018 → 31/12/2022
Project: Academy of Finland: Other research funding