Sample Efficient Reinforcement Learning through Learning from Demonstrations in Minecraft

03/12/2020
by   Christian Scheller, et al.
3

Sample inefficiency of deep reinforcement learning methods is a major obstacle for their use in real-world applications. In this work, we show how human demonstrations can improve final performance of agents on the Minecraft minigame ObtainDiamond with only 8M frames of environment interaction. We propose a training procedure where policy networks are first trained on human data and later fine-tuned by reinforcement learning. Using a policy exploitation mechanism, experience replay and an additional loss against catastrophic forgetting, our best agent was able to achieve a mean score of 48. Our proposed solution placed 3rd in the NeurIPS MineRL Competition for Sample-Efficient Reinforcement Learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2020

The MineRL Competition on Sample-Efficient Reinforcement Learning Using Human Priors: A Retrospective

To facilitate research in the direction of sample-efficient reinforcemen...
research
04/01/2020

Obstacle Tower Without Human Demonstrations: How Far a Deep Feed-Forward Network Goes with Reinforcement Learning

The Obstacle Tower Challenge is the task to master a procedurally genera...
research
08/30/2020

Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems

Recent successes combine reinforcement learning algorithms and deep neur...
research
06/18/2021

Sample Efficient Social Navigation Using Inverse Reinforcement Learning

In this paper, we present an algorithm to efficiently learn socially-com...
research
11/26/2020

Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning

Prioritized experience replay (PER) samples important transitions, rathe...
research
06/26/2022

Improving Policy Optimization with Generalist-Specialist Learning

Generalization in deep reinforcement learning over unseen environment va...
research
04/22/2019

The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors

Though deep reinforcement learning has led to breakthroughs in many diff...

Please sign up or login with your details

Forgot password? Click here to reset