Learning Self-Game-Play Agents for Combinatorial Optimization Problems

03/08/2019
by   Ruiyang Xu, et al.
0

Recent progress in reinforcement learning (RL) using self-game-play has shown remarkable performance on several board games (e.g., Chess and Go) as well as video games (e.g., Atari games and Dota2). It is plausible to consider that RL, starting from zero knowledge, might be able to gradually approximate a winning strategy after a certain amount of training. In this paper, we explore neural Monte-Carlo-Tree-Search (neural MCTS), an RL algorithm which has been applied successfully by DeepMind to play Go and Chess at a super-human level. We try to leverage the computational power of neural MCTS to solve a class of combinatorial optimization problems. Following the idea of Hintikka's Game-Theoretical Semantics, we propose the Zermelo Gamification (ZG) to transform specific combinatorial optimization problems into Zermelo games whose winning strategies correspond to the solutions of the original optimization problem. The ZG also provides a specially designed neural MCTS. We use a combinatorial planning problem for which the ground-truth policy is efficiently computable to demonstrate that ZG is promising.

READ FULL TEXT
research
01/17/2021

Solving QSAT problems with neural MCTS

Recent achievements from AlphaZero using self-play has shown remarkable ...
research
07/04/2018

Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization

Adversarial self-play in two-player games has delivered impressive resul...
research
10/22/2020

Exploring search space trees using an adapted version of Monte Carlo tree search for combinatorial optimization problems

In this article, a novel approach to solve combinatorial optimization pr...
research
02/11/2022

Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems

Over the recent years, reinforcement learning (RL) has shown impressive ...
research
02/17/2023

Automated Graph Genetic Algorithm based Puzzle Validation for Faster Game Design

Many games are reliant on creating new and engaging content constantly t...
research
01/11/2021

First-Order Problem Solving through Neural MCTS based Reinforcement Learning

The formal semantics of an interpreted first-order logic (FOL) statement...
research
10/18/2020

Visibility Optimization for Surveillance-Evasion Games

We consider surveillance-evasion differential games, where a pursuer mus...

Please sign up or login with your details

Forgot password? Click here to reset