We propose Algorithm Distillation (AD), a method for distilling reinforc...
What is the action sequence aa'a" that was likely responsible for reachi...
This paper deals with the problem of learning a skill-conditioned policy...
Unsupervised skill learning objectives (Gregor et al., 2016, Eysenbach e...
In the absence of external rewards, agents can still learn useful behavi...
Memory is an important aspect of intelligence and plays a role in many d...
It has been established that diverse behaviors spanning the controllable...
Learning to control an environment without hand-crafted rewards or exper...
We propose Ephemeral Value Adjusments (EVA): a means of allowing deep
re...