Auto-Agent-Distiller: Towards Efficient Deep Reinforcement Learning Agents via Neural Architecture Search

12/24/2020
by   Yonggan Fu, et al.
0

AlphaGo's astonishing performance has ignited an explosive interest in developing deep reinforcement learning (DRL) for numerous real-world applications, such as intelligent robotics. However, the often prohibitive complexity of DRL stands at the odds with the required real-time control and constrained resources in many DRL applications, limiting the great potential of DRL powered intelligent devices. While substantial efforts have been devoted to compressing other deep learning models, existing works barely touch the surface of compressing DRL. In this work, we first identify that there exists an optimal model size of DRL that can maximize both the test scores and efficiency, motivating the need for task-specific DRL agents. We therefore propose an Auto-Agent-Distiller (A2D) framework, which to our best knowledge is the first neural architecture search (NAS) applied to DRL to automatically search for the optimal DRL agents for various tasks that optimize both the test scores and efficiency. Specifically, we demonstrate that vanilla NAS can easily fail in searching for the optimal agents, due to its resulting high variance in DRL training stability, and then develop a novel distillation mechanism to distill the knowledge from both the teacher agent's actor and critic to stabilize the searching process and improve the searched agents' optimality. Extensive experiments and ablation studies consistently validate our findings and the advantages and general applicability of our A2D, outperforming manually designed DRL in both the test scores and efficiency. All the codes will be released upon acceptance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2021

A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning

Driven by the explosive interest in applying deep reinforcement learning...
research
10/15/2020

Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control

While Deep Reinforcement Learning (DRL) has emerged as a promising appro...
research
06/15/2022

Search-Based Testing Approach for Deep Reinforcement Learning Agents

Deep Reinforcement Learning (DRL) algorithms have been increasingly empl...
research
04/15/2021

Quantum Architecture Search via Deep Reinforcement Learning

Recent advances in quantum computing have drawn considerable attention t...
research
05/22/2023

Testing of Deep Reinforcement Learning Agents with Surrogate Models

Deep Reinforcement Learning (DRL) has received a lot of attention from t...
research
09/18/2020

RLzoo: A Comprehensive and Adaptive Reinforcement Learning Library

Recently, we have seen a rapidly growing adoption of Deep Reinforcement ...
research
06/12/2020

Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning

This paper presents a neuro-symbolic agent that combines deep reinforcem...

Please sign up or login with your details

Forgot password? Click here to reset