Danger-aware Weighted Advantage Composition of Deep Reinforcement Learning for Robot Navigation

09/11/2018
by   Wei Zhang, et al.
0

Self-navigation, referring to automatically reaching the goal while avoiding collision with obstacles, is a fundamental skill of mobile robots. Currently, Deep Reinforcement Learning (DRL) can enable the robot to navigate in a more complex environment with less computation power compared to conventional methods. However, it is time-consuming and hard to train the robot to learn goal-reaching and obstacle-avoidance skills simultaneously using DRL-based algorithms. In this paper, two Dueling Deep Q Networks (DQN) named Goal Network and Avoidance Network are used to learn the goal-reaching and obstacle-avoidance skills individually. A novel method named danger-aware advantage composition is proposed to fuse the two networks together without any redesigning and retraining. The composed Navigation Network can enable the robot to reach the goal right behind the wall and to navigate in unknown complexed environment safely and quickly.

READ FULL TEXT

page 3

page 4

page 5

page 6

page 7

research
07/14/2018

Hierarchical Reinforcement Learning Framework towards Multi-agent Navigation

This paper proposes a navigation algorithm ori- ented to multi-agent dyn...
research
10/28/2020

Dynamically Feasible Deep Reinforcement Learning Policy for Robot Navigation in Dense Mobile Crowds

We present a novel Deep Reinforcement Learning (DRL) based policy for mo...
research
07/08/2022

HTRON:Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm

We present a novel approach to improve the performance of deep reinforce...
research
06/15/2023

Evolutionary Curriculum Training for DRL-Based Navigation Systems

In recent years, Deep Reinforcement Learning (DRL) has emerged as a prom...
research
03/26/2021

SegVisRL: Visuomotor Development for a Lunar Rover for Hazard Avoidance using Camera Images

The visuomotor system of any animal is critical for its survival, and th...
research
10/20/2020

Learn to Navigate Maplessly with Varied LiDAR Configurations: A Support Point Based Approach

Deep reinforcement learning (DRL) demonstrates great potential in maples...
research
10/03/2022

Obstacle Avoidance for Robotic Manipulator in Joint Space via Improved Proximal Policy Optimization

Reaching tasks with random targets and obstacles can still be challengin...

Please sign up or login with your details

Forgot password? Click here to reset