DROPO: Sim-to-real transfer with offline domain randomization

Gabriele Tiboni, Karol Arndt*, Ville Kyrki

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

23 Citations (Scopus)
157 Downloads (Pure)

Abstract

In recent years, domain randomization over dynamics parameters has gained a lot of traction as a method for sim-to-real transfer of reinforcement learning policies in robotic manipulation; however, finding optimal randomization distributions can be difficult. In this paper, we introduce DROPO, a novel method for estimating domain randomization distributions for safe sim-to-real transfer. Unlike prior work, DROPO only requires a limited, precollected offline dataset of trajectories, and explicitly models parameter uncertainty to match real data using a likelihood-based approach. We demonstrate that DROPO is capable of recovering dynamic parameter distributions in simulation and finding a distribution capable of compensating for an unmodeled phenomenon. We also evaluate the method in two zero-shot sim-to-real transfer scenarios, showing successful domain transfer and improved performance over prior methods.

Original languageEnglish
Article number104432
Number of pages15
JournalRobotics and Autonomous Systems
Volume166
DOIs
Publication statusPublished - Aug 2023
MoE publication typeA1 Journal article-refereed

Funding

This work was supported by Academy of Finland grants 317020 and 328399 . We acknowledge the computational resources generously provided by CSC – IT Center for Science, Finland, and by the Aalto Science-IT project.

Keywords

  • Domain randomization
  • Reinforcement learning
  • Robot learning
  • Transfer learning

Fingerprint

Dive into the research topics of 'DROPO: Sim-to-real transfer with offline domain randomization'. Together they form a unique fingerprint.
  • -: Bridging the Reality Gap in Autonomous Learning

    Kyrki, V. (Principal investigator), Copado Rodriguez, J. (Project Member), Hietala, H. (Project Member), Muff, J. (Project Member), Blanco Mulero, D. (Project Member), Alcan, G. (Project Member) & Arndt, K. (Project Member)

    01/01/202031/12/2022

    Project: Academy of Finland: Other research funding

  • -: AI spider silk threading

    Kyrki, V. (Principal investigator), Arndt, K. (Project Member), Blanco Mulero, D. (Project Member), Petrik, V. (Project Member), Le, D. (Project Member) & Sari, O. (Project Member)

    01/01/201831/12/2022

    Project: Academy of Finland: Other research funding

  • Science-IT

    Hakala, M. (Manager)

    School of Science

    Facility/equipment: Facility

Cite this