Preference-Based Monte Carlo Tree Search

07/17/2018
by   Tobias Joppen, et al.
0

Monte Carlo tree search (MCTS) is a popular choice for solving sequential anytime problems. However, it depends on a numeric feedback signal, which can be difficult to define. Real-time MCTS is a variant which may only rarely encounter states with an explicit, extrinsic reward. To deal with such cases, the experimenter has to supply an additional numeric feedback signal in the form of a heuristic, which intrinsically guides the agent. Recent work has shown evidence that in different areas the underlying structure is ordinal and not numerical. Hence erroneous and biased heuristics are inevitable, especially in such domains. In this paper, we propose a MCTS variant which only depends on qualitative feedback, and therefore opens up new applications for MCTS. We also find indications that translating absolute into ordinal feedback may be beneficial. Using a puzzle domain, we show that our preference-based MCTS variant, wich only receives qualitative feedback, is able to reach a performance level comparable to a regular MCTS baseline, which obtains quantitative feedback.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2019

Ordinal Monte Carlo Tree Search

In many problem settings, most notably in game playing, an agent receive...
research
05/31/2019

Ordinal Bucketing for Game Trees using Dynamic Quantile Approximation

In this paper, we present a simple and cheap ordinal bucketing algorithm...
research
02/26/2023

Towards Tackling MaxSAT by Combining Nested Monte Carlo with Local Search

Recent work proposed the UCTMAXSAT algorithm to address Maximum Satisfia...
research
05/06/2020

Learning, transferring, and recommending performance knowledge with Monte Carlo tree search and neural networks

Making changes to a program to optimize its performance is an unscalable...
research
04/24/2023

Combining Monte Carlo Tree Search and Heuristic Search for Weighted Vertex Coloring

This work investigates the Monte Carlo Tree Search (MCTS) method combine...
research
02/03/2022

On Monte Carlo Tree Search for Weighted Vertex Coloring

This work presents the first study of using the popular Monte Carlo Tree...
research
11/20/2022

Demon in the machine: learning to extract work and absorb entropy from fluctuating nanosystems

We use Monte Carlo and genetic algorithms to train neural-network feedba...

Please sign up or login with your details

Forgot password? Click here to reset