Jointly Learning Cost and Constraints from Demonstrations for Safe Trajectory Generation

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

Abstract

Learning from Demonstration (LfD) allows robots to mimic human actions. However, these methods do not model constraints crucial to ensure safety of the learned skill. Moreover, even when explicitly modelling constraints, they rely on the assumption of a known cost function, which limits their practical usability for task with unknown cost. In this work we propose a two-step optimization process that allow to estimate cost and constraints by decoupling the learning of cost functions from the identification of unknown constraints within the demonstrated trajectories. Initially, we identify the cost function by isolating the effect of constraints on parts of the demonstrations. Subsequently, a constraint leaning method is used to identify the unknown constraints. Our approach is validated both on simulated trajectories and a real robotic manipulation task. Our experiments show the impact that incorrect cost estimation has on the learned constraints and illustrate how the proposed method is able to infer unknown constraints, such as obstacles, from demonstrated trajectories without any initial knowledge of the cost.
Original languageEnglish
Title of host publication2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
PublisherIEEE
Pages3635-3642
Number of pages8
ISBN (Electronic)979-8-3503-7770-5
DOIs
Publication statusPublished - 14 Oct 2024
MoE publication typeA4 Conference publication
EventIEEE/RSJ International Conference on Intelligent Robots and Systems - ADNEC, Abu Dhabi, United Arab Emirates
Duration: 14 Oct 202418 Oct 2024
https://iros2024-abudhabi.org/

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

ConferenceIEEE/RSJ International Conference on Intelligent Robots and Systems
Abbreviated titleIROS
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period14/10/202418/10/2024
Internet address

Fingerprint

Dive into the research topics of 'Jointly Learning Cost and Constraints from Demonstrations for Safe Trajectory Generation'. Together they form a unique fingerprint.

Cite this