Homotopy Based Reinforcement Learning with Maximum Entropy for Autonomous Air Combat

12/01/2021
by   Yiwen Zhu, et al.
0

The Intelligent decision of the unmanned combat aerial vehicle (UCAV) has long been a challenging problem. The conventional search method can hardly satisfy the real-time demand during high dynamics air combat scenarios. The reinforcement learning (RL) method can significantly shorten the decision time via using neural networks. However, the sparse reward problem limits its convergence speed and the artificial prior experience reward can easily deviate its optimal convergent direction of the original task, which raises great difficulties for the RL air combat application. In this paper, we propose a homotopy-based soft actor-critic method (HSAC) which focuses on addressing these problems via following the homotopy path between the original task with sparse reward and the auxiliary task with artificial prior experience reward. The convergence and the feasibility of this method are also proved in this paper. To confirm our method feasibly, we construct a detailed 3D air combat simulation environment for the RL-based methods training firstly, and we implement our method in both the attack horizontal flight UCAV task and the self-play confrontation task. Experimental results show that our method performs better than the methods only utilizing the sparse reward or the artificial prior experience reward. The agent trained by our method can reach more than 98.3 67.4 methods.

READ FULL TEXT
research
11/03/2021

A Self-adaptive LSAC-PID Approach based on Lyapunov Reward Shaping for Mobile Robots

To solve the coupling problem of control loops and the adaptive paramete...
research
05/03/2021

Hierarchical Reinforcement Learning for Air-to-Air Combat

Artificial Intelligence (AI) is becoming a critical component in the def...
research
07/10/2021

Learning-to-Dispatch: Reinforcement Learning Based Flight Planning under Emergency

The effectiveness of resource allocation under emergencies especially hu...
research
03/06/2023

Reinforcement Learning Based Self-play and State Stacking Techniques for Noisy Air Combat Environment

Reinforcement learning (RL) has recently proven itself as a powerful ins...
research
04/28/2022

Actor-Critic Scheduling for Path-Aware Air-to-Ground Multipath Multimedia Delivery

Reinforcement Learning (RL) has recently found wide applications in netw...
research
01/14/2022

Reinforcement Learning based Air Combat Maneuver Generation

The advent of artificial intelligence technology paved the way of many r...
research
09/14/2017

A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping

Image cropping aims at improving the aesthetic quality of images by adju...

Please sign up or login with your details

Forgot password? Click here to reset