Abstract
Imitation learning (IL) algorithms typically distill experience into parametric behavior policies to mimic expert demonstrations. However, with limited demonstrations, existing methods often struggle to generate accurate actions, particularly under partial observability. To address this problem, we introduce a few-shot IL approach, ReMoBot, which directly retrieves information from demonstrations to solve Mobile manipulation tasks with ego-centric visual observations. Given the current observation, ReMoBot utilizes vision foundation models to identify relevant demonstrations, considering visual similarity w.r.t. both individual observations and history trajectories. A motion selection policy then selects the proper command for the robot until the task is successfully completed. The performance of ReMoBot is evaluated on three mobile manipulation tasks with a Boston Dynamics Spot robot in both simulation and the real world. With only 20 demonstrations, ReMoBot outperforms the baselines, achieving high success rates in Table Uncover (70%) and Gap Cover (80%), while also showing promising performance on the more challenging Curtain Open task in the real-world setting. Furthermore, ReMoBot demonstrates generalization across varying robot positions, object sizes, and material types.
| Original language | English |
|---|---|
| Title of host publication | CoRL 2025 RemembeRL Workshop |
| Subtitle of host publication | What can past experience tell us about our current action? |
| Number of pages | 15 |
| Publication status | Published - 27 Sept 2025 |
| MoE publication type | D3 Professional conference proceedings |
| Event | RemembeRL: What can past experience tell us about our current action? - COEX Convention & Exhibition Center, Seoul, Korea, Democratic People's Republic of Duration: 27 Sept 2025 → 27 Sept 2025 https://rememberl-corl25.github.io/ |
Workshop
| Workshop | RemembeRL |
|---|---|
| Abbreviated title | RemembeRL |
| Country/Territory | Korea, Democratic People's Republic of |
| City | Seoul |
| Period | 27/09/2025 → 27/09/2025 |
| Other | CoRL 2025 RemembeRL Workshop : What can past experience tell us about our current action? |
| Internet address |
Fingerprint
Dive into the research topics of 'ReMoBot: Retrieval-Based Few-Shot Imitation Learning for Mobile Manipulation with Vision Foundation Models'. Together they form a unique fingerprint.Equipment
-
Micro-electronics, Digital and Autonomous Systems (MIDAS)
School of Electrical EngineeringFacility/equipment: Facility
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver