Meta-trained agents implement Bayes-optimal agents

10/21/2020
by   Vladimir Mikulik, et al.
8

Memory-based meta-learning is a powerful technique to build agents that adapt fast to any task within a target distribution. A previous theoretical study has argued that this remarkable performance is because the meta-training protocol incentivises agents to behave Bayes-optimally. We empirically investigate this claim on a number of prediction and bandit tasks. Inspired by ideas from theoretical computer science, we show that meta-learned and Bayes-optimal agents not only behave alike, but they even share a similar computational structure, in the sense that one agent system can approximately simulate the other. Furthermore, we show that Bayes-optimal agents are fixed points of the meta-learning dynamics. Our results suggest that memory-based meta-learning might serve as a general technique for numerically approximating Bayes-optimal agents - that is, even for task distributions for which we currently don't possess tractable models.

READ FULL TEXT

page 7

page 17

page 20

page 21

page 22

page 23

research
05/08/2019

Meta-learning of Sequential Strategies

In this report we review memory-based meta-learning as a tool for buildi...
research
02/06/2023

Memory-Based Meta-Learning on Non-Stationary Distributions

Memory-based meta-learning is a technique for approximating Bayes-optima...
research
09/30/2022

Beyond Bayes-optimality: meta-learning what you know you don't know

Meta-training agents with memory has been shown to culminate in Bayes-op...
research
03/03/2021

Meta-Learning with Variational Bayes

The field of meta-learning seeks to improve the ability of today's machi...
research
10/07/2019

Meta-Learning Deep Energy-Based Memory Models

We study the problem of learning associative memory – a system which is ...
research
01/11/2021

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Agents that interact with other agents often do not know a priori what t...
research
10/06/2020

Dif-MAML: Decentralized Multi-Agent Meta-Learning

The objective of meta-learning is to exploit the knowledge obtained from...

Please sign up or login with your details

Forgot password? Click here to reset