Train Offline, Test Online: A Real Robot Learning Benchmark

Gaoyue Zhou*1    Victoria Dean*1    Mohan Kumar Srirama1    Aravind Rajeswaran2,5    Jyothish Pari3   
Kyle Hatch4    Aryan Jain5    Tianhe Yu4    Pieter Abbeel5    Lerrel Pinto3    Chelsea Finn4    Abhinav Gupta1   

1 Carnegie Mellon University, Robotics Institute    2 University of Washington    3 New York University    4 Stanford University    5 University of California, Berkeley   

ri uw nyu stanford bair

In Submission to IEEE International Conference on Robotics and Automation (ICRA), 2023

CodeDataset

Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on these challenges via a new benchmark: Train Offline, Test Online (TOTO). TOTO provides remote users with access to shared robots for evaluating methods on common tasks and an open-source dataset of these tasks for offline training. Its manipulation task suite requires challenging generalization to unseen objects, positions, and lighting. We present initial results on TOTO comparing five pretrained visual representations and four offline policy learning baselines, remotely contributed by five institutions. The real promise of TOTO, however, lies in the future: we release the benchmark for additional submissions from any user, enabling easy, direct comparison to several methods without the need to obtain hardware or collect data.


Benchmark Overview

toto_teaser

Results

Pouring

toto

Scooping

toto
pouring_results
toto
pouring_results
toto