From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

10/18/2022
by   Zichen Jeff Cui, et al.
0

While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging. One critical reason for this is that uncurated robot demonstration data, i.e. play data, collected from non-expert human demonstrators are often noisy, diverse, and distributionally multi-modal. This makes extracting useful, task-centric behaviors from such data a difficult generative modeling problem. In this work, we present Conditional Behavior Transformers (C-BeT), a method that combines the multi-modal generation ability of Behavior Transformer with future-conditioned goal specification. On a suite of simulated benchmark tasks, we find that C-BeT improves upon prior state-of-the-art work in learning from play data by an average of 45.7 that useful task-centric behaviors can be learned on a real-world robot purely from play data without any task labels or reward information. Robot videos are best viewed on our project website: https://play-to-policy.github.io

READ FULL TEXT

page 2

page 4

page 5

page 6

page 15

page 16

research
06/22/2022

Behavior Transformers: Cloning k modes with one stone

While behavior learning has made impressive progress in recent times, it...
research
03/10/2022

PLATO: Predicting Latent Affordances Through Object-Centric Play

Constructing a diverse repertoire of manipulation skills in a scalable f...
research
06/11/2020

Learning to Play by Imitating Humans

Acquiring multiple skills has commonly involved collecting a large numbe...
research
09/02/2021

Learning Language-Conditioned Robot Behavior from Offline Data and Crowd-Sourced Annotation

We study the problem of learning a range of vision-based manipulation ta...
research
03/25/2022

Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers

Natural language is the most intuitive medium for us to interact with ot...
research
05/12/2022

A Generalist Agent

Inspired by progress in large-scale language modeling, we apply a simila...
research
06/26/2019

From Multi-modal Property Dataset to Robot-centric Conceptual Knowledge About Household Objects

Tool-use applications in robotics require conceptual knowledge about obj...

Please sign up or login with your details

Forgot password? Click here to reset