Control as Hybrid Inference

07/11/2020
by   Alexander Tschantz, et al.
1

The field of reinforcement learning can be split into model-based and model-free methods. Here, we unify these approaches by casting model-free policy optimisation as amortised variational inference, and model-based planning as iterative variational inference, within a `control as hybrid inference' (CHI) framework. We present an implementation of CHI which naturally mediates the balance between iterative and amortised inference. Using a didactic experiment, we demonstrate that the proposed algorithm operates in a model-based manner at the onset of learning, before converging to a model-free algorithm once sufficient data have been collected. We verify the scalability of our algorithm on a continuous control benchmark, demonstrating that it outperforms strong model-free and model-based baselines. CHI thus provides a principled framework for harnessing the sample efficiency of model-based planning while retaining the asymptotic performance of model-free policy optimisation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/08/2019

Value-of-Information based Arbitration between Model-based and Model-free Control

There have been numerous attempts in explaining the general learning beh...
research
06/13/2020

Reinforcement Learning as Iterative and Amortised Inference

There are several ways to categorise reinforcement learning (RL) algorit...
research
10/22/2020

Optimising Stochastic Routing for Taxi Fleets with Model Enhanced Reinforcement Learning

The future of mobility-as-a-Service (Maas)should embrace an integrated s...
research
11/30/2020

An accelerated hybrid data-driven/model-based approach for poroelasticity problems with multi-fidelity multi-physics data

We present a hybrid model/model-free data-driven approach to solve poroe...
research
02/25/2019

Learning Extreme Hummingbird Maneuvers on Flapping Wing Robots

Biological studies show that hummingbirds can perform extreme aerobatic ...
research
09/05/2023

Model-agnostic network inference enhancement from noisy measurements via curriculum learning

Noise is a pervasive element within real-world measurement data, signifi...
research
12/03/2019

Adaptive Online Planning for Continual Lifelong Learning

We study learning control in an online lifelong learning scenario, where...

Please sign up or login with your details

Forgot password? Click here to reset