Constrained Model-Free Reinforcement Learning for Process Optimization

11/16/2020
by   Elton Pan, et al.
0

Reinforcement learning (RL) is a control approach that can handle nonlinear stochastic optimal control problems. However, despite the promise exhibited, RL has yet to see marked translation to industrial practice primarily due to its inability to satisfy state constraints. In this work we aim to address this challenge. We propose an 'oracle'-assisted constrained Q-learning algorithm that guarantees the satisfaction of joint chance constraints with a high probability, which is crucial for safety critical tasks. To achieve this, constraint tightening (backoffs) are introduced and adjusted using Broyden's method, hence making them self-tuned. This results in a general methodology that can be imbued into approximate dynamic programming-based algorithms to ensure constraint satisfaction with high probability. Finally, we present case studies that analyze the performance of the proposed approach and compare this algorithm with model predictive control (MPC). The favorable performance of this algorithm signifies a step toward the incorporation of RL into real world optimization and control of engineering systems, where constraints are essential in ensuring safety.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2020

Chance Constrained Policy Optimization for Process Control and Optimization

Chemical process optimization and control are affected by 1) plant-model...
research
06/26/2019

Approximate Dynamic Programming For Linear Systems with State and Input Constraints

Enforcing state and input constraints during reinforcement learning (RL)...
research
05/31/2019

Extending Deep Model Predictive Control with Safety Augmented Value Estimation from Demonstrations

Reinforcement learning (RL) for robotics is challenging due to the diffi...
research
06/04/2020

Constrained Reinforcement Learning for Dynamic Optimization under Uncertainty

Dynamic real-time optimization (DRTO) is a challenging task due to the f...
research
02/17/2021

Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning

Safety is essential for reinforcement learning (RL) applied in real-worl...
research
04/20/2021

Model-predictive control and reinforcement learning in multi-energy system case studies

Model-predictive-control (MPC) offers an optimal control technique to es...
research
09/10/2023

Signal Temporal Logic Neural Predictive Control

Ensuring safety and meeting temporal specifications are critical challen...

Please sign up or login with your details

Forgot password? Click here to reset