Co-imagination of Behaviour and Morphology of Agents

Maria Sliacka, Michael Mistry, Roberto Calandra, Ville Kyrki, Kevin Sebastian Luck*

*Tämän työn vastaava kirjoittaja

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

Abstrakti

The field of robot learning has made great advances in developing behaviour learning methodologies capable of learning policies for tasks ranging from manipulation to locomotion. However, the problem of combined learning of behaviour and robot structure, here called co-adaptation, is less studied. Most of the current co-adapting robot learning approaches rely on model-free algorithms or assume to have access to an a-priori known dynamics model, which requires considerable human engineering. In this work, we investigate the potential of combining model-free and model-based reinforcement learning algorithms for their application on co-adaptation problems with unknown dynamics functions. Classical model-based reinforcement learning is concerned with learning the forward dynamics of a specific agent or robot in its environment. However, in the case of jointly learning the behaviour and morphology of agents, each individual agent-design implies its own specific dynamics function. Here, the challenge is to learn a dynamics model capable of generalising between the different individual dynamics functions or designs. In other words, the learned dynamics model approximates a multi-dynamics function with the goal to generalise between different agent designs. We present a reinforcement learning algorithm that uses a learned multi-dynamics model for co-adapting robot’s behaviour and morphology using imagined rollouts. We show that using a multi-dynamics model for imagining transitions can lead to better performance for model-free co-adaptation, but open challenges remain.

AlkuperäiskieliEnglanti
OtsikkoMachine Learning, Optimization, and Data Science - 9th International Conference, LOD 2023
ToimittajatGiuseppe Nicosia, Varun Ojha, Emanuele La Malfa, Gabriele La Malfa, Panos M. Pardalos, Renato Umeton
KustantajaSpringer
Sivut318-332
Sivumäärä15
ISBN (painettu)978-3-031-53968-8
DOI - pysyväislinkit
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaInternational Conference on Machine Learning, Optimization, and Data Science - Grasmere, Iso-Britannia
Kesto: 22 syysk. 202326 syysk. 2023

Julkaisusarja

NimiLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Vuosikerta14505 LNCS
ISSN (painettu)0302-9743
ISSN (elektroninen)1611-3349

Conference

ConferenceInternational Conference on Machine Learning, Optimization, and Data Science
LyhennettäLOD
Maa/AlueIso-Britannia
KaupunkiGrasmere
Ajanjakso22/09/202326/09/2023

Sormenjälki

Sukella tutkimusaiheisiin 'Co-imagination of Behaviour and Morphology of Agents'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä