Adjust Planning Strategies to Accommodate Reinforcement Learning Agents

03/19/2020
by   Xuerun Chen, et al.
0

In agent control issues, the idea of combining reinforcement learning and planning has attracted much attention. Two methods focus on micro and macro action respectively. Their advantages would show together if there is a good cooperation between them. An essential for the cooperation is to find an appropriate boundary, assigning different functions to each method. Such boundary could be represented by parameters in a planning algorithm. In this paper, we create an optimization strategy for planning parameters, through analysis to the connection of reaction and planning; we also create a non-gradient method for accelerating the optimization. The whole algorithm can find a satisfactory setting of planning parameters, making full use of reaction capability of specific agents.

READ FULL TEXT
research
01/23/2019

Hierarchical Reinforcement Learning for Multi-agent MOBA Game

Although deep reinforcement learning has achieved great success recently...
research
09/10/2022

Cooperation and Competition: Flocking with Evolutionary Multi-Agent Reinforcement Learning

Flocking is a very challenging problem in a multi-agent system; traditio...
research
01/08/2022

Assessing Policy, Loss and Planning Combinations in Reinforcement Learning using a New Modular Architecture

The model-based reinforcement learning paradigm, which uses planning alg...
research
05/11/2019

CoLight: Learning Network-level Cooperation for Traffic Signal Control

Cooperation is critical in multi-agent reinforcement learning (MARL). In...
research
02/08/2023

Efficient Planning in Combinatorial Action Spaces with Applications to Cooperative Multi-Agent Reinforcement Learning

A practical challenge in reinforcement learning are combinatorial action...
research
12/05/2019

Inter-Level Cooperation in Hierarchical Reinforcement Learning

This article presents a novel algorithm for promoting cooperation betwee...
research
10/30/2019

Network Classifiers With Output Smoothing

This work introduces two strategies for training network classifiers wit...

Please sign up or login with your details

Forgot password? Click here to reset