Best Arm Identification (BAI) problems are progressively used for
data-s...
Collecting and leveraging data with good coverage properties plays a cru...
Optimistic algorithms have been extensively studied for regret minimizat...
In probably approximately correct (PAC) reinforcement learning (RL), an ...
We consider the problem introduced by <cit.> of identifying all the
ε-op...
We investigate the classical active pure exploration problem in Markov
D...
We investigate the problem of best-policy identification in discounted M...