Train Offline, Test Online: A Real Robot Learning Benchmark

Gaoyue Zhou¹ Victoria Dean¹ Mohan Kumar Srirama¹ Aravind Rajeswaran^2,5 Jyothish Pari³
Kyle Hatch⁴ Aryan Jain⁵ Tianhe Yu⁴ Pieter Abbeel⁵ Lerrel Pinto³ Chelsea Finn⁴ Abhinav Gupta¹

¹ Carnegie Mellon University, Robotics Institute ² University of Washington ³ New York University ⁴ Stanford University ⁵ University of California, Berkeley

In Submission to IEEE International Conference on Robotics and Automation (ICRA), 2023

Code Dataset

Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robots for evaluating methods on common tasks and an open-source dataset of these tasks for offline training. Its manipulation task suite requires challenging generalization to unseen objects, positions, and lighting. We present initial results on TOTO comparing five pretrained visual representations and four offline policy learning baselines, remotely contributed by five institutions. The real promise of TOTO, however, lies in the future: we release the benchmark for additional submissions from any user, enabling easy, direct comparison to several methods without the need to obtain hardware or collect data.

Benchmark Overview

Results

Pouring

Scooping