Learning to Teach Reinforcement Learning Agents

07/28/2017
by   Anestis Fachantidis, et al.
0

In this article we study the transfer learning model of action advice under a budget. We focus on reinforcement learning teachers providing action advice to heterogeneous students playing the game of Pac-Man under a limited advice budget. First, we examine several critical factors affecting advice quality in this setting, such as the average performance of the teacher, its variance and the importance of reward discounting in advising. The experiments show the non-trivial importance of the coefficient of variation (CV) as a statistic for choosing policies that generate advice. The CV statistic relates variance to the corresponding mean. Second, the article studies policy learning for distributing advice under a budget. Whereas most methods in the relevant literature rely on heuristics for advice distribution we formulate the problem as a learning one and propose a novel RL algorithm capable of learning when to advise, adapting to the student and the task at hand. Furthermore, we argue that learning to advise under a budget is an instance of a more generic learning problem: Constrained Exploitation Reinforcement Learning.

READ FULL TEXT
research
11/29/2020

A Q-values Sharing Framework for Multiagent Reinforcement Learning under Budget Constraint

In teacher-student framework, a more experienced agent (teacher) helps a...
research
05/30/2019

Don't Forget Your Teacher: A Corrective Reinforcement Learning Framework

Although reinforcement learning (RL) can provide reliable solutions in m...
research
02/07/2020

Student/Teacher Advising through Reward Augmentation

Transfer learning is an important new subfield of multiagent reinforceme...
research
04/17/2021

Learning on a Budget via Teacher Imitation

Deep Reinforcement Learning (RL) techniques can benefit greatly from lev...
research
09/06/2023

Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning

We study the budget allocation problem in online marketing campaigns tha...
research
04/17/2021

Action Advising with Advice Imitation in Deep Reinforcement Learning

Action advising is a peer-to-peer knowledge exchange technique built on ...

Please sign up or login with your details

Forgot password? Click here to reset