Fleet Control using Coregionalized Gaussian Process Policy Iteration

11/22/2019
by   Timothy Verstraeten, et al.
0

In many settings, as for example wind farms, multiple machines are instantiated to perform the same task, which is called a fleet. The recent advances with respect to the Internet of Things allow control devices and/or machines to connect through cloud-based architectures in order to share information about their status and environment. Such an infrastructure allows seamless data sharing between fleet members, which could greatly improve the sample-efficiency of reinforcement learning techniques. However in practice, these machines, while almost identical in design, have small discrepancies due to production errors or degradation, preventing control algorithms to simply aggregate and employ all fleet data. We propose a novel reinforcement learning method that learns to transfer knowledge between similar fleet members and creates member-specific dynamics models for control. Our algorithm uses Gaussian processes to establish cross-member covariances. This is significantly different from standard transfer learning methods, as the focus is not on sharing information over tasks, but rather over system specifications. We demonstrate our approach on two benchmarks and a realistic wind farm setting. Our method significantly outperforms two baseline approaches, namely individual learning and joint learning where all samples are aggregated, in terms of the median and variance of the results.

READ FULL TEXT
research
11/02/2020

Sample-efficient reinforcement learning using deep Gaussian processes

Reinforcement learning provides a framework for learning to control whic...
research
03/27/2022

Optimizing Airborne Wind Energy with Reinforcement Learning

Airborne Wind Energy is a lightweight technology that allows power extra...
research
09/09/2016

Dialogue manager domain adaptation using Gaussian process reinforcement learning

Spoken dialogue systems allow humans to interact with machines using nat...
research
05/24/2019

InfoRL: Interpretable Reinforcement Learning using Information Maximization

Recent advances in reinforcement learning have proved that given an envi...
research
09/03/2019

Generalization in Transfer Learning

Agents trained with deep reinforcement learning algorithms are capable o...
research
10/05/2021

Improved Reinforcement Learning Coordinated Control of a Mobile Manipulator using Joint Clamping

Many robotic path planning problems are continuous, stochastic, and high...

Please sign up or login with your details

Forgot password? Click here to reset