Actor-Critic Methods using Physics-Informed Neural Networks: Control of a 1D PDE Model for Fluid-Cooled Battery Packs

05/18/2023
by   Amartya Mukherjee, et al.
0

This paper proposes an actor-critic algorithm for controlling the temperature of a battery pack using a cooling fluid. This is modeled by a coupled 1D partial differential equation (PDE) with a controlled advection term that determines the speed of the cooling fluid. The Hamilton-Jacobi-Bellman (HJB) equation is a PDE that evaluates the optimality of the value function and determines an optimal controller. We propose an algorithm that treats the value network as a Physics-Informed Neural Network (PINN) to solve for the continuous-time HJB equation rather than a discrete-time Bellman optimality equation, and we derive an optimal controller for the environment that we exploit to achieve optimal control. Our experiments show that a hybrid-policy method that updates the value network using the HJB equation and updates the policy network identically to PPO achieves the best results in the control of this PDE system.

READ FULL TEXT

page 15

page 16

page 17

research
10/07/2020

Actor-Critic Algorithm for High-dimensional Partial Differential Equations

We develop a deep learning model to effectively solve high-dimensional n...
research
09/11/2019

Generalized Policy Iteration for Optimal Control in Continuous Time

This paper proposes the Deep Generalized Policy Iteration (DGPI) algorit...
research
02/22/2021

Actor-Critic Method for High Dimensional Static Hamilton–Jacobi–Bellman Partial Differential Equations based on Neural Networks

We propose a novel numerical method for high dimensional Hamilton–Jacobi...
research
04/02/2021

Distributional Offline Continuous-Time Reinforcement Learning with Neural Physics-Informed PDEs (SciPhy RL for DOCTR-L)

This paper addresses distributional offline continuous-time reinforcemen...
research
03/15/2022

Physics-Informed Neural Networks with Adaptive Localized Artificial Viscosity

Physics-informed Neural Network (PINN) is a promising tool that has been...
research
05/02/2022

Chance-Constrained Stochastic Optimal Control via Path Integral and Finite Difference Methods

This paper addresses a continuous-time continuous-space chance-constrain...
research
11/24/2013

Off-policy reinforcement learning for H_∞ control design

The H_∞ control design problem is considered for nonlinear systems with ...

Please sign up or login with your details

Forgot password? Click here to reset