b'Steven Hansen'

research

∙ 10/25/2022

In-context Reinforcement Learning with Algorithm Distillation

We propose Algorithm Distillation (AD), a method for distilling reinforc...

1 Michael (Misha) Laskin, et al. ∙

research

∙ 06/02/2022

Uniqueness and Complexity of Inverse MDP Models

What is the action sequence aa'a" that was likely responsible for reachi...

1 Marcus Hutter, et al. ∙

research

∙ 10/28/2021

Wasserstein Distance Maximizing Intrinsic Control

This paper deals with the problem of learning a skill-conditioned policy...

0 Ishan Durugkar, et al. ∙

research

∙ 07/29/2021

Learning more skills through optimistic exploration

Unsupervised skill learning objectives (Gregor et al., 2016, Eysenbach e...

7 DJ Strouse, et al. ∙

research

∙ 12/14/2020

Relative Variational Intrinsic Control

In the absence of external rewards, agents can still learn useful behavi...

0 Kate Baumli, et al. ∙

research

∙ 10/29/2019

Generalization of Reinforcement Learners with Working and Episodic Memory

Memory is an important aspect of intelligence and plays a role in many d...

26 Meire Fortunato, et al. ∙

research

∙ 06/12/2019

Fast Task Inference with Variational Intrinsic Successor Features

It has been established that diverse behaviors spanning the controllable...

0 Steven Hansen, et al. ∙

research

∙ 11/28/2018

Unsupervised Control Through Non-Parametric Discriminative Rewards

Learning to control an environment without hand-crafted rewards or exper...

0 David Warde-Farley, et al. ∙

research

∙ 10/18/2018

Fast deep reinforcement learning using online adjustments from the past

We propose Ephemeral Value Adjusments (EVA): a means of allowing deep re...

0 Steven Hansen, et al. ∙

Steven Hansen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro