We consider infinite horizon Markov decision processes (MDPs) with fast-...
Bayesian optimization (BO) is a sample-efficient approach to optimizing
...
Bayesian optimization is a sequential decision making framework for
opti...
We introduce the lookahead-bounded Q-learning (LBQL) algorithm, a new,
p...
The problem of exploration in unknown environments continues to pose a
c...
Bayesian optimization provides sample-efficient global optimization for ...
Inspired by recent successes of Monte-Carlo tree search (MCTS) in a numb...
Monte Carlo Tree Search (MCTS), most famously used in game-play artifici...
In this paper, we consider a finite-horizon Markov decision process (MDP...