Research on Autonomous Maneuvering Decision of UCAV based on Approximate Dynamic Programming

by   Zhencai Hu, et al.

Unmanned aircraft systems can perform some more dangerous and difficult missions than manned aircraft systems. In some highly complicated and changeable tasks, such as air combat, the maneuvering decision mechanism is required to sense the combat situation accurately and make the optimal strategy in real-time. This paper presents a formulation of a 3-D one-on-one air combat maneuvering problem and an approximate dynamic programming approach for computing an optimal policy on autonomous maneuvering decision making. The aircraft learns combat strategies in a Reinforcement Leaning method, while sensing the environment, taking available maneuvering actions and getting feedback reward signals. To solve the problem of dimensional explosion in the air combat, the proposed method is implemented through feature selection, trajectory sampling, function approximation and Bellman backup operation in the air combat simulation environment. This approximate dynamic programming approach provides a fast response to a rapidly changing tactical situation, learns in long-term planning, without any explicitly coded air combat rule base.


page 1

page 2

page 3

page 4


The Value Iteration Algorithm is Not Strongly Polynomial for Discounted Dynamic Programming

This note provides a simple example demonstrating that, if exact computa...

A Deep Reinforcement Learning Driving Policy for Autonomous Road Vehicles

This work regards our preliminary investigation on the problem of path p...

A 3D Game Theoretical Framework for the Evaluation of Unmanned Aircraft Systems Airspace Integration Concepts

Predicting the outcomes of integrating Unmanned Aerial Systems (UAS) int...

Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL

Recent works have shown that tackling offline reinforcement learning (RL...

Optimal Multiple Stopping Rule for Warm-Starting Sequential Selection

In this paper we present the Warm-starting Dynamic Thresholding algorith...

Single item stochastic lot sizing problem considering capital flow and business overdraft

This paper introduces capital flow to the single item stochastic lot siz...

Please sign up or login with your details

Forgot password? Click here to reset