In this work, we derive sharp non-asymptotic deviation bounds for weight...
We consider the problem of minimizing a non-convex function over a smoot...
We consider the reinforcement learning (RL) setting, in which the agent ...
We consider reinforcement learning in an environment modeled by an episo...
We propose the Bayes-UCBVI algorithm for reinforcement learning in tabul...
We consider the problem of learning the optimal policy for infinite-hori...
We study the computation of non-regularized Wasserstein barycenters of
p...