Multi-Fidelity Reinforcement Learning with Gaussian Processes

12/18/2017
by   Varun Suryan, et al.
0

This paper studies the problem of Reinforcement Learning (RL) using as few real-world samples as possible. A naive application of RL algorithms can be inefficient in large and continuous state spaces. We present two versions of Multi-Fidelity Reinforcement Learning (MFRL) algorithm that leverage Gaussian Processes (GPs) to learn the optimal policy in a real-world environment. In MFRL framework, an agent uses multiple simulators of the real environment to perform actions. With increasing fidelity in a simulator chain, the number of samples used in successively higher simulators can be reduced. By incorporating GPs in MFRL framework, further reduction in the number of learning samples can be achieved as we move up the simulator chain. We examine the performance of our algorithms with the help of real-world experiments for navigation with a ground robot.

READ FULL TEXT

page 5

page 6

research
07/07/2022

gym-DSSAT: a crop model turned into a Reinforcement Learning environment

Addressing a real world sequential decision problem with Reinforcement L...
research
06/10/2022

Multifidelity Reinforcement Learning with Control Variates

In many computational science and engineering applications, the output o...
research
01/13/2021

Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems

Applications of reinforcement learning (RL) to stabilization problems of...
research
07/26/2022

Semi-analytical Industrial Cooling System Model for Reinforcement Learning

We present a hybrid industrial cooling system model that embeds analytic...
research
12/21/2021

Off Environment Evaluation Using Convex Risk Minimization

Applying reinforcement learning (RL) methods on robots typically involve...
research
04/22/2019

The MineRL Competition on Sample Efficient Reinforcement Learning using Human Priors

Though deep reinforcement learning has led to breakthroughs in many diff...
research
07/29/2021

Non-Markovian Reinforcement Learning using Fractional Dynamics

Reinforcement learning (RL) is a technique to learn the control policy f...

Please sign up or login with your details

Forgot password? Click here to reset