A New Optimal Stepsize For Approximate Dynamic Programming

07/10/2014
by   Ilya O. Ryzhov, et al.
0

Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally intensive, and it is important to obtain good results quickly. Furthermore, the most popular stepsize formulas use tunable parameters and can produce very poor results if tuned improperly. We derive a new stepsize rule that optimizes the prediction error in order to improve the short-term performance of an ADP algorithm. With only one, relatively insensitive tunable parameter, the new rule adapts to the level of noise in the problem and produces faster convergence in numerical experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2013

Evolutionary Algorithms and Dynamic Programming

Recently, it has been proven that evolutionary algorithms produce good r...
research
02/26/2019

Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Real-world problems of operations research are typically high-dimensiona...
research
09/28/2021

The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation

When the sizes of the state and action spaces are large, solving MDPs ca...
research
05/08/2012

Approximate Dynamic Programming By Minimizing Distributionally Robust Bounds

Approximate dynamic programming is a popular method for solving large Ma...
research
08/06/2021

HJB-RBF based approach for the control of PDEs

Semi-lagrangian schemes for discretization of the dynamic programming pr...
research
01/23/2023

A New Dynamic Programming Approach for Spanning Trees with Chain Constraints and Beyond

Short spanning trees subject to additional constraints are important bui...

Please sign up or login with your details

Forgot password? Click here to reset