Combining Deep Reinforcement Learning and Search for Imperfect-Information Games

07/27/2020
by   Noam Brown, et al.
0

The combination of deep reinforcement learning and search at both training and test time is a powerful paradigm that has led to a number of a successes in single-agent settings and perfect-information games, best exemplified by the success of AlphaZero. However, algorithms of this form have been unable to cope with imperfect-information games. This paper presents ReBeL, a general framework for self-play reinforcement learning and search for imperfect-information games. In the simpler setting of perfect-information games, ReBeL reduces to an algorithm similar to AlphaZero. Results show ReBeL leads to low exploitability in benchmark imperfect-information games and achieves superhuman performance in heads-up no-limit Texas hold'em poker, while using far less domain knowledge than any prior poker AI. We also prove that ReBeL converges to a Nash equilibrium in two-player zero-sum games in tabular settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2016

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games

Many real-world applications can be described as large-scale games of im...
research
12/06/2021

Player of Games

Games have a long history of serving as a benchmark for progress in arti...
research
08/26/2019

OpenSpiel: A Framework for Reinforcement Learning in Games

OpenSpiel is a collection of environments and algorithms for research in...
research
08/30/2018

ExIt-OOS: Towards Learning from Planning in Imperfect Information Games

The current state of the art in playing many important perfect informati...
research
06/18/2020

DREAM: Deep Regret minimization with Advantage baselines and Model-free learning

We introduce DREAM, a deep reinforcement learning algorithm that finds o...
research
04/06/2022

DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning

Recent years have witnessed the great breakthrough of deep reinforcement...
research
06/10/2021

Subgame solving without common knowledge

In imperfect-information games, subgame solving is significantly more ch...

Please sign up or login with your details

Forgot password? Click here to reset