Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study

08/05/2021
by   Mihir Patil, et al.
0

Docking control of an autonomous underwater vehicle (AUV) is a task that is integral to achieving persistent long term autonomy. This work explores the application of state-of-the-art model-free deep reinforcement learning (DRL) approaches to the task of AUV docking in the continuous domain. We provide a detailed formulation of the reward function, utilized to successfully dock the AUV onto a fixed docking platform. A major contribution that distinguishes our work from the previous approaches is the usage of a physics simulator to define and simulate the underwater environment as well as the DeepLeng AUV. We propose a new reward function formulation for the docking task, incorporating several components, that outperforms previous reward formulations. We evaluate proximal policy optimization (PPO), twin delayed deep deterministic policy gradients (TD3) and soft actor-critic (SAC) in combination with our reward function. Our evaluation yielded results that conclusively show the TD3 agent to be most efficient and consistent in terms of docking the AUV, over multiple evaluation runs it achieved a 100 also show how our reward function formulation improves over the state of the art.

READ FULL TEXT

page 1

page 3

page 4

research
03/20/2022

Reinforcement learning reward function in unmanned aerial vehicle control tasks

This paper presents a new reward function that can be used for deep rein...
research
03/06/2022

Leveraging Reward Gradients For Reinforcement Learning in Differentiable Physics Simulations

In recent years, fully differentiable rigid body physics simulators have...
research
04/25/2023

Fulfilling Formal Specifications ASAP by Model-free Reinforcement Learning

We propose a model-free reinforcement learning solution, namely the ASAP...
research
09/13/2023

Self-Refined Large Language Model as Automated Reward Function Designer for Deep Reinforcement Learning in Robotics

Although Deep Reinforcement Learning (DRL) has achieved notable success ...
research
06/01/2018

Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method

Flocking control has been studied extensively along with the wide applic...
research
09/09/2021

DROP: Deep relocating option policy for optimal ride-hailing vehicle repositioning

In a ride-hailing system, an optimal relocation of vacant vehicles can s...
research
09/13/2022

Mapless Navigation of a Hybrid Aerial Underwater Vehicle with Deep Reinforcement Learning Through Environmental Generalization

Previous works showed that Deep-RL can be applied to perform mapless nav...

Please sign up or login with your details

Forgot password? Click here to reset