Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

Wenyan Yang, Alexandre Angleraud, Roel S. Pieters, Joni Pajarinen, Joni Kristian Kamarainen

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

2 Sitaatiot (Scopus)

Abstrakti

Robot control for tactile feedback based manip-ulation can be difficult due to modeling of physical contacts, partial observability of the environment, and noise in perception and control. This work focuses on solving partial observability of contact-rich manipulation tasks as a Sequence-to-Sequence (Seq2Seq) Imitation Learning (IL) problem. The proposed Seq2Seq model first produces a robot-environment interaction sequence to estimate the partially observable environment state variables, and then, the observed interaction sequence is transformed to a control sequence for the task itself. The proposed Seq2Seq IL for tactile feedback based manipulation is experimentally validated on a door-open task in a simulated environment and a snap-on insertion task with a real robot. The model is able to learn both tasks from only 50 expert demonstrations while state-of-the-art reinforcement learning and imitation learning methods fail.

AlkuperäiskieliEnglanti
OtsikkoProceedings - ICRA 2023
AlaotsikkoIEEE International Conference on Robotics and Automation
KustantajaIEEE
Sivut5829-5836
Sivumäärä8
ISBN (elektroninen)979-8-3503-2365-8
DOI - pysyväislinkit
TilaJulkaistu - 2023
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaIEEE International Conference on Robotics and Automation - ExCeL Exhibition Center, London, Iso-Britannia
Kesto: 29 toukok. 20232 kesäk. 2023
https://www.icra2023.org

Conference

ConferenceIEEE International Conference on Robotics and Automation
LyhennettäICRA
Maa/AlueIso-Britannia
KaupunkiLondon
Ajanjakso29/05/202302/06/2023
www-osoite

Sormenjälki

Sukella tutkimusaiheisiin 'Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä