E-HBA: Using Action Policies for Expert Advice and Agent Typification

07/23/2019
by   Stefano V. Albrecht, et al.
1

Past research has studied two approaches to utilise predefined policy sets in repeated interactions: as experts, to dictate our own actions, and as types, to characterise the behaviour of other agents. In this work, we bring these complementary views together in the form of a novel meta-algorithm, called Expert-HBA (E-HBA), which can be applied to any expert algorithm that considers the average (or total) payoff an expert has yielded in the past. E-HBA gradually mixes the past payoff with a predicted future payoff, which is computed using the type-based characterisation. We present results from a comprehensive set of repeated matrix games, comparing the performance of several well-known expert algorithms with and without the aid of E-HBA. Our results show that E-HBA has the potential to significantly improve the performance of expert algorithms.

READ FULL TEXT
research
04/26/2020

Predicting Plans and Actions in Two-Player Repeated Games

Artificial intelligence (AI) agents will need to interact with both othe...
research
10/02/2017

The Strategy of Experts for Repeated Predictions

We investigate the behavior of experts who seek to make predictions with...
research
05/11/2021

Designing an Automatic Agent for Repeated Language based Persuasion Games

Persuasion games are fundamental in economics and AI research and serve ...
research
07/07/2021

Episodic Bandits with Stochastic Experts

We study a version of the contextual bandit problem where an agent is gi...
research
06/01/2023

Ten Steps to Becoming a Musculoskeletal Simulation Expert: A Half-Century of Progress and Outlook for the Future

Over the past half-century, musculoskeletal simulations have deepened ou...
research
07/15/2019

On Convergence and Optimality of Best-Response Learning with Policy Types in Multiagent Systems

While many multiagent algorithms are designed for homogeneous systems (i...
research
07/17/2023

Meta-Value Learning: a General Framework for Learning with Learning Awareness

Gradient-based learning in multi-agent systems is difficult because the ...

Please sign up or login with your details

Forgot password? Click here to reset