While deep reinforcement learning (RL) has fueled multiple high-profile
...
Standard imitation learning can fail when the expert demonstrators have
...
Training a reinforcement learning (RL) agent on a real-world robotics ta...
Meta-gradients provide a general approach for optimizing the meta-parame...
Consistency is the theoretical property of a meta learning algorithm tha...
How much credit (or blame) should an action taken in a state get for a f...
In this work, we study auxiliary prediction tasks defined by
temporal-di...
Order dispatching and driver repositioning (also known as fleet manageme...
Model-agnostic meta-learners aim to acquire meta-learned parameters from...
Gradient-based meta-learners such as MAML are able to learn a meta-prior...
Using neural networks in practical settings would benefit from the abili...