Standard approaches to sequential decision-making exploit an agent's abi...
Actor-critic algorithms that make use of distributional policy evaluatio...
Many advances that have improved the robustness and efficiency of deep
r...
Offline reinforcement learning (RL), also known as batch RL, offers the
...
Deep reinforcement learning has led to many recent-and
groundbreaking-ad...
This paper introduces R2D3, an agent that makes efficient use of
demonst...
We propose a novel framework for the analysis of learning algorithms tha...
Bayesian optimization has recently emerged as a popular and efficient to...
Optimising black-box functions is important in many disciplines, such as...
We address the problem of finding the maximizer of a nonlinear smooth
fu...