← ML Research Wiki / 2401.02117

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Zipeng Fu Stanford University, Tony Z Zhao Stanford University, Chelsea Finn Stanford University (2024)

Paper Information
arXiv ID
Venue
arXiv.org
Domain
Robotics
SOTA Claim
Yes
Code
Reproducibility
8/10

Abstract

Abstract not available.

Summary

This paper discusses the Mobile ALOHA system, a low-cost whole-body teleoperation framework designed for bimanual mobile manipulation tasks. It emphasizes the importance of imitating complex mobile manipulation through imitation learning, which combines data collected from its novel teleoperation system with existing static datasets to enhance performance. Mobile ALOHA successfully demonstrates multiple complex tasks, improving success rates significantly when utilizing co-training strategies. The system enables high-performance operation with only a small number of human demonstrations in diverse real-world environments, thus showcasing a new approach to robotic manipulation.

Methods

This paper employs the following methods:

  • Imitation Learning
  • Supervised Behavior Cloning
  • Co-training

Models Used

  • Mobile ALOHA
  • ALOHA
  • ACT
  • Diffusion Policy
  • VINN

Datasets

The following datasets were used in this research:

  • Static ALOHA datasets

Evaluation Metrics

  • Success Rate

Results

  • 95% success on Wipe Wine
  • 95% success on Call Elevator
  • 85% success on Use Cabinet
  • 80% success on Rinse Pan
  • 80% success on Push Chairs

Limitations

The authors identified the following limitations:

  • Current footprint of Mobile ALOHA may be too narrow for certain paths
  • Fixed height of arms makes lower cabinets difficult to reach
  • Robot cannot autonomously improve or explore new knowledge

Technical Requirements

  • Number of GPUs: 1
  • GPU Type: NVIDIA 3070 Ti

Keywords

Mobile manipulation Whole-body control Imitation learning Teleoperation Bimanual manipulation

Papers Using Similar Methods

External Resources