Widest Paths and Global Propagation in Bounded Value Iteration for Stochastic Games

07/15/2020
by   Kittiphon Phalakarn, et al.
0

Solving stochastic games with the reachability objective is a fundamental problem, especially in quantitative verification and synthesis. For this purpose, bounded value iteration (BVI) attracts attention as an efficient iterative method. However, BVI's performance is often impeded by costly end component (EC) computation that is needed to ensure convergence. Our contribution is a novel BVI algorithm that conducts, in addition to local propagation by the Bellman update that is typical of BVI, global propagation of upper bounds that is not hindered by ECs. To conduct global propagation in a computationally tractable manner, we construct a weighted graph and solve the widest path problem in it. Our experiments show the algorithm's performance advantage over the previous BVI algorithms that rely on EC computation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Stopping Criteria for Value and Strategy Iteration on Concurrent Stochastic Reachability Games

We consider concurrent stochastic games played on graphs with reachabili...
research
09/17/2021

Stochastic Games with Disjunctions of Multiple Objectives

Stochastic games combine controllable and adversarial non-determinism wi...
research
08/10/2021

Stochastic Games with Disjunctions of Multiple Objectives (Technical Report)

Stochastic games combine controllable and adversarial non-determinism wi...
research
08/21/2020

Comparison of Algorithms for Simple Stochastic Games (Full Version)

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
05/03/2023

Solving irreducible stochastic mean-payoff games and entropy games by relative Krasnoselskii-Mann iteration

We analyse an algorithm solving stochastic mean-payoff games, combining ...
research
07/29/2022

Optimistic and Topological Value Iteration for Simple Stochastic Games

While value iteration (VI) is a standard solution approach to simple sto...
research
06/05/2019

A neural network based policy iteration algorithm with global H^2-superlinear convergence for stochastic games on domains

In this work, we propose a class of numerical schemes for solving semili...

Please sign up or login with your details

Forgot password? Click here to reset