Neural Optimal Control using Learned System Dynamics

02/20/2023
by   Selim Engin, et al.
0

We study the problem of generating control laws for systems with unknown dynamics. Our approach is to represent the controller and the value function with neural networks, and to train them using loss functions adapted from the Hamilton-Jacobi-Bellman (HJB) equations. In the absence of a known dynamics model, our method first learns the state transitions from data collected by interacting with the system in an offline process. The learned transition function is then integrated to the HJB equations and used to forward simulate the control signals produced by our controller in a feedback loop. In contrast to trajectory optimization methods that optimize the controller for a single initial state, our controller can generate near-optimal control signals for initial states from a large portion of the state space. Compared to recent model-based reinforcement learning algorithms, we show that our method is more sample efficient and trains faster by an order of magnitude. We demonstrate our method in a number of tasks, including the control of a quadrotor with 12 state variables.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2019

HJB Optimal Feedback Control with Deep Differential Value Functions and Action Constraints

Learning optimal feedback control laws capable of executing optimal traj...
research
12/19/2019

Numerical Optimal Control of HIV Transmission in Octave/MATLAB

We provide easy and readable GNU Octave/MATLAB code for the simulation o...
research
10/02/2020

Memory Clustering using Persistent Homology for Multimodality- and Discontinuity-Sensitive Learning of Optimal Control Warm-starts

Shooting methods are an efficient approach to solving nonlinear optimal ...
research
03/07/2019

RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm for Continuous Control of Nonlinear Dynamical Systems

Nonlinear optimal control problems are often solved with numerical metho...
research
04/12/2022

Near-Optimal Distributed Linear-Quadratic Regulator for Networked Systems

This paper studies the trade-off between the degree of decentralization ...
research
02/02/2023

Faster Consensus via Sparser Controller

In this paper, we investigate the architecture of an optimal controller ...
research
02/13/2019

Using Approximate Models in Robot Learning

Trajectory following is one of the complicated control problems when its...

Please sign up or login with your details

Forgot password? Click here to reset