Temporal Parallelisation of the HJB Equation and Continuous-Time Linear Quadratic Control

12/22/2022
by   Simo Särkkä, et al.
0

This paper presents a mathematical formulation to perform temporal parallelisation of continuous-time optimal control problems, which are solved via the Hamilton–Jacobi–Bellman (HJB) equation. We divide the time interval of the control problem into sub-intervals, and define a control problem in each sub-interval, conditioned on the start and end states, leading to conditional value functions for the sub-intervals. By defining an associative operator as the minimisation of the sum of conditional value functions, we obtain the elements and associative operators for a parallel associative scan operation. This allows for solving the optimal control problem on the whole time interval in parallel in logarithmic time complexity in the number of sub-intervals. We derive the HJB-type of backward and forward equations for the conditional value functions and solve them in closed form for linear quadratic problems. We also discuss other numerical methods for computing the conditional value functions and present closed form solutions for selected special cases. The computational advantages of the proposed parallel methods are demonstrated via simulations run on a multi-core central processing unit and a graphics processing unit.

READ FULL TEXT

page 1

page 16

research
04/07/2021

Temporal Parallelisation of Dynamic Programming and Linear Quadratic Control

This paper proposes a method for temporal parallelisation of dynamic pro...
research
01/27/2022

Towards Data-driven LQR with KoopmanizingFlows

We propose a novel framework for learning linear time-invariant (LTI) mo...
research
12/15/2019

Symplectic Runge-Kutta discretization of a regularized forward-backward sweep iteration for optimal control problems

Li, Chen, Tai E. (J. Machine Learning Research, 2018) have proposed ...
research
02/18/2021

Distributed Algorithms for Linearly-Solvable Optimal Control in Networked Multi-Agent Systems

Distributed algorithms for both discrete-time and continuous-time linear...
research
12/16/2016

Computing solutions of linear Mahler equations

Mahler equations relate evaluations of the same function f at iterated b...
research
12/02/2020

Computing bounds for imprecise continuous-time Markov chains using normal cones

The theory of imprecise Markov chains has achieved significant progress ...
research
11/01/2017

Determination of Checkpointing Intervals for Malleable Applications

Selecting optimal intervals of checkpointing an application is important...

Please sign up or login with your details

Forgot password? Click here to reset