Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

10/05/2022
by   David Brandfonbrener, et al.
8

We consider how to most efficiently leverage teleoperator time to collect data for learning robust image-based value functions and policies for sparse reward robotic tasks. To accomplish this goal, we modify the process of data collection to include more than just successful demonstrations of the desired task. Instead we develop a novel protocol that we call Visual Backtracking Teleoperation (VBT), which deliberately collects a dataset of visually similar failures, recoveries, and successes. VBT data collection is particularly useful for efficiently learning accurate value functions from small datasets of image-based observations. We demonstrate VBT on a real robot to perform continuous control from image observations for the deformable manipulation task of T-shirt grasping. We find that by adjusting the data collection process we improve the quality of both the learned value functions and policies over a variety of baseline methods for data collection. Specifically, we find that offline reinforcement learning on VBT data outperforms standard behavior cloning on successful demonstration data by 13 equal-sized datasets of 60 minutes of data from the real robot.

READ FULL TEXT

page 1

page 3

page 5

research
05/10/2021

Efficient Self-Supervised Data Collection for Offline Robot Learning

A practical approach to robot reinforcement learning is to first collect...
research
01/27/2022

The Challenges of Exploration for Offline Reinforcement Learning

Offline Reinforcement Learning (ORL) enablesus to separately study the t...
research
06/09/2022

Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations

Offline reinforcement learning has shown great promise in leveraging lar...
research
11/27/2020

Offline Learning from Demonstrations and Unlabeled Experience

Behavior cloning (BC) is often practical for robot learning because it a...
research
02/25/2020

Scalable Multi-Task Imitation Learning with Autonomous Improvement

While robot learning has demonstrated promising results for enabling rob...
research
11/06/2022

Leveraging Haptic Feedback to Improve Data Quality and Quantity for Deep Imitation Learning Models

Learning from demonstration (LfD) is a proven technique to teach robots ...
research
12/16/2022

Offline Reinforcement Learning for Visual Navigation

Reinforcement learning can enable robots to navigate to distant goals wh...

Please sign up or login with your details

Forgot password? Click here to reset