Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari

11/19/2021
by   Dominik Schmidt, et al.
0

Across the Arcade Learning Environment, Rainbow achieves a level of performance competitive with humans and modern RL algorithms. However, attaining this level of performance requires large amounts of data and hardware resources, making research in this area computationally expensive and use in practical applications often infeasible. This paper's contribution is threefold: We (1) propose an improved version of Rainbow, seeking to drastically reduce Rainbow's data, training time, and compute requirements while maintaining its competitive performance; (2) we empirically demonstrate the effectiveness of our approach through experiments on the Arcade Learning Environment, and (3) we conduct a number of ablation studies to investigate the effect of the individual proposed modifications. Our improved version of Rainbow reaches a median human normalized score close to classic Rainbow's, while using 20 times less data and requiring only 7.5 hours of training time on a single GPU. We also provide our full implementation including pre-trained models.

READ FULL TEXT
research
03/29/2020

Sample Efficient Ensemble Learning with Catalyst.RL

We present Catalyst.RL, an open-source PyTorch framework for reproducibl...
research
06/20/2023

Deep Fusion: Efficient Network Training via Pre-trained Initializations

In recent years, deep learning has made remarkable progress in a wide ra...
research
05/21/2020

Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition

Deep reinforcement learning (deep RL) is a combination of deep learning ...
research
03/08/2022

End-to-end Multiple Instance Learning with Gradient Accumulation

Being able to learn on weakly labeled data, and provide interpretability...
research
12/23/2018

Parallelized Interactive Machine Learning on Autonomous Vehicles

Deep reinforcement learning (deep RL) has achieved superior performance ...
research
11/01/2021

Human-Level Control without Server-Grade Hardware

Deep Q-Network (DQN) marked a major milestone for reinforcement learning...
research
03/13/2022

Context-LSTM: a robust classifier for video detection on UCF101

Video detection and human action recognition may be computationally expe...

Please sign up or login with your details

Forgot password? Click here to reset