Transfer Heterogeneous Knowledge Among Peer-to-Peer Teammates: A Model Distillation Approach

02/06/2020
by   Zeyue Xue, et al.
0

Peer-to-peer knowledge transfer in distributed environments has emerged as a promising method since it could accelerate learning and improve team-wide performance without relying on pre-trained teachers in deep reinforcement learning. However, for traditional peer-to-peer methods such as action advising, they have encountered difficulties in how to efficiently expressed knowledge and advice. As a result, we propose a brand new solution to reuse experiences and transfer value functions among multiple students via model distillation. But it is still challenging to transfer Q-function directly since it is unstable and not bounded. To address this issue confronted with existing works, we adopt Categorical Deep Q-Network. We also describe how to design an efficient communication protocol to exploit heterogeneous knowledge among multiple distributed agents. Our proposed framework, namely Learning and Teaching Categorical Reinforcement (LTCR), shows promising performance on stabilizing and accelerating learning progress with improved team-wide reward in four typical experimental environments.

READ FULL TEXT
research
03/07/2019

Learning Hierarchical Teaching in Cooperative Multiagent Reinforcement Learning

Heterogeneous knowledge naturally arises among different agents in coope...
research
12/09/2020

Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation

In reinforcement learning, domain randomisation is an increasingly popul...
research
04/19/2019

Teaching on a Budget in Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning algorithms can solve complex sequential deci...
research
10/05/2022

On Neural Consolidation for Transfer in Reinforcement Learning

Although transfer learning is considered to be a milestone in deep reinf...
research
04/08/2022

How does online teamwork change student communication patterns in programming courses?

Online teaching has become a new reality due to the COVID-19 pandemic ra...
research
06/18/2023

"You might think about slightly revising the title": identifying hedges in peer-tutoring interactions

Hedges play an important role in the management of conversational intera...
research
04/17/2021

Action Advising with Advice Imitation in Deep Reinforcement Learning

Action advising is a peer-to-peer knowledge exchange technique built on ...

Please sign up or login with your details

Forgot password? Click here to reset