A Tour of Reinforcement Learning: The View from Continuous Control

06/25/2018
by   Benjamin Recht, et al.
14

This manuscript surveys reinforcement learning from the perspective of optimization and control with a focus on continuous control applications. It surveys the general formulation, terminology, and typical experimental implementations of reinforcement learning and reviews competing solution paradigms. In order to compare the relative merits of various techniques, this survey presents a case study of the Linear Quadratic Regulator (LQR) with unknown dynamics, perhaps the simplest and best studied problem in optimal control. The manuscript describes how merging techniques from learning theory and control can provide non-asymptotic characterizations of LQR performance and shows that these characterizations tend to match experimental behavior. In turn, when revisiting more complex applications, many of the observed phenomena in LQR persist. In particular, theory and experiment demonstrate the role and importance of models and the cost of generality in reinforcement learning algorithms. This survey concludes with a discussion of some of the challenges in designing learning systems that safely and reliably interact with complex and uncertain environments and how tools from reinforcement learning and controls might be combined to approach these challenges.

READ FULL TEXT
research
12/14/2022

Quantum Control based on Deep Reinforcement Learning

In this thesis, we consider two simple but typical control problems and ...
research
10/10/2022

Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies

Gradient-based methods have been widely used for system design and optim...
research
03/19/2023

Reinforcement Learning-supported AB Testing of Business Process Improvements: An Industry Perspective

In order to better facilitate the need for continuous business process i...
research
11/03/2022

Reinforcement Learning in Non-Markovian Environments

Following the novel paradigm developed by Van Roy and coauthors for rein...
research
07/06/2021

Survey of Self-Play in Reinforcement Learning

In reinforcement learning (RL), the term self-play describes a kind of m...
research
11/02/2020

Exact Asymptotics for Linear Quadratic Adaptive Control

Recent progress in reinforcement learning has led to remarkable performa...
research
12/11/2019

Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization

We analyze the efficacy of modern neuro-evolutionary strategies for cont...

Please sign up or login with your details

Forgot password? Click here to reset