Offline Reinforcement Learning for Visual Navigation

12/16/2022
by   Dhruv Shah, et al.
0

Reinforcement learning can enable robots to navigate to distant goals while optimizing user-specified reward functions, including preferences for following lanes, staying on paved paths, or avoiding freshly mowed grass. However, online learning from trial-and-error for real-world robots is logistically challenging, and methods that instead can utilize existing datasets of robotic navigation data could be significantly more scalable and enable broader generalization. In this paper, we present ReViND, the first offline RL system for robotic navigation that can leverage previously collected data to optimize user-specified reward functions in the real-world. We evaluate our system for off-road navigation without any additional data collection or fine-tuning, and show that it can navigate to distant goals using only offline training from this dataset, and exhibit behaviors that qualitatively differ based on the user-specified reward function.

READ FULL TEXT

page 2

page 6

page 7

page 16

research
05/17/2023

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

This paper studies tabular reinforcement learning (RL) in the hybrid set...
research
10/19/2021

Offline Reinforcement Learning with Value-based Episodic Memory

Offline reinforcement learning (RL) shows promise of applying RL to real...
research
05/31/2023

Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning

For robotic vehicles to navigate robustly and safely in unseen environme...
research
01/28/2019

Designing a Multi-Objective Reward Function for Creating Teams of Robotic Bodyguards Using Deep Reinforcement Learning

We are considering a scenario where a team of bodyguard robots provides ...
research
06/02/2023

SACSoN: Scalable Autonomous Data Collection for Social Navigation

Machine learning provides a powerful tool for building socially complian...
research
10/05/2022

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

We consider how to most efficiently leverage teleoperator time to collec...
research
08/08/2023

Optimizing Algorithms From Pairwise User Preferences

Typical black-box optimization approaches in robotics focus on learning ...

Please sign up or login with your details

Forgot password? Click here to reset