Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanab

04/28/2020
by   Rodrigo Canaan, et al.
12

Hanabi is a cooperative game that brings the problem of modeling other players to the forefront. In this game, coordinated groups of players can leverage pre-established conventions to great effect, but playing in an ad-hoc setting requires agents to adapt to its partner's strategies with no previous coordination. Evaluating an agent in this setting requires a diverse population of potential partners, but so far, the behavioral diversity of agents has not been considered in a systematic way. This paper proposes Quality Diversity algorithms as a promising class of algorithms to generate diverse populations for this purpose, and generates a population of diverse Hanabi agents using MAP-Elites. We also postulate that agents can benefit from a diverse population during training and implement a simple "meta-strategy" for adapting to an agent's perceived behavioral niche. We show this meta-strategy can work better than generalist strategies even outside the population it was trained with if its partner's behavioral niche can be correctly inferred, but in practice a partner's behavior depends and interferes with the meta-agent's own behavior, suggesting an avenue for future research in characterizing another agent's behavior during gameplay.

READ FULL TEXT

page 6

page 9

research
04/28/2020

Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi

Hanabi is a cooperative game that brings the problem of modeling other p...
research
07/08/2019

Diverse Agents for Ad-Hoc Cooperation in Hanabi

In complex scenarios where a model of other actors is necessary to predi...
research
03/12/2023

Behavioral Differences is the Key of Ad-hoc Team Cooperation in Multiplayer Games Hanabi

Ad-hoc team cooperation is the problem of cooperating with other players...
research
08/18/2023

Minimum Coverage Sets for Training Robust Ad Hoc Teamwork Agents

Robustly cooperating with unseen agents and human partners presents sign...
research
04/28/2020

Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners

Hanabi is a cooperative game that challenges exist-ing AI techniques due...
research
02/10/2019

Learning Best Response Strategies for Agents in Ad Exchanges

Ad exchanges are widely used in platforms for online display advertising...
research
05/15/2018

Complexity Reduction in the Negotiation of New Lexical Conventions

In the process of collectively inventing new words for new con- cepts in...

Please sign up or login with your details

Forgot password? Click here to reset