Actor-Critic Method for High Dimensional Static Hamilton–Jacobi–Bellman Partial Differential Equations based on Neural Networks

02/22/2021
by   Mo Zhou, et al.
0

We propose a novel numerical method for high dimensional Hamilton–Jacobi–Bellman (HJB) type elliptic partial differential equations (PDEs). The HJB PDEs, reformulated as optimal control problems, are tackled by the actor-critic framework inspired by reinforcement learning, based on neural network parametrization of the value and control functions. Within the actor-critic framework, we employ a policy gradient approach to improve the control, while for the value function, we derive a variance reduced least square temporal difference method (VR-LSTD) using stochastic calculus. To numerically discretize the stochastic control problem, we employ an adaptive stepsize scheme to improve the accuracy near the domain boundary. Numerical examples up to 20 spatial dimensions including the linear quadratic regulators, the stochastic Van der Pol oscillators, and the diffusive Eikonal equations are presented to validate the effectiveness of our proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2020

Actor-Critic Algorithm for High-dimensional Partial Differential Equations

We develop a deep learning model to effectively solve high-dimensional n...
research
03/13/2023

Actor-Critic learning for mean-field control in continuous time

We study policy gradient for mean-field control in continuous time in a ...
research
04/18/2022

On Parametric Optimal Execution and Machine Learning Surrogates

We investigate optimal execution problems with instantaneous price impac...
research
05/18/2023

Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs

This paper proposes an actor-critic algorithm for controlling the temper...
research
05/13/2018

General solutions for nonlinear differential equations: a deep reinforcement learning approach

Physicists use differential equations to describe the physical dynamical...
research
01/25/2023

Distributed Control of Partial Differential Equations Using Convolutional Reinforcement Learning

We present a convolutional framework which significantly reduces the com...
research
05/02/2010

Adaptive Bases for Reinforcement Learning

We consider the problem of reinforcement learning using function approxi...

Please sign up or login with your details

Forgot password? Click here to reset