Comparison of Algorithms for Simple Stochastic Games

09/23/2020
by   Jan Křetínský, et al.
0

Simple stochastic games are turn-based 2.5-player zero-sum graph games with a reachability objective. The problem is to compute the winning probability as well as the optimal strategies of both players. In this paper, we compare the three known classes of algorithms – value iteration, strategy iteration and quadratic programming – both theoretically and practically. Further, we suggest several improvements for all algorithms, including the first approach based on quadratic programming that avoids transforming the stochastic game to a stopping one. Our extensive experiments show that these improvements can lead to significant speed-ups. We implemented all algorithms in PRISM-games 3.0, thereby providing the first implementation of quadratic programming for solving simple stochastic games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

Comparison of Algorithms for Simple Stochastic Games (Full Version)

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
02/24/2020

Optimal strategies in the Fighting Fantasy gaming system: influencing stochastic dynamics by gambling with limited resource

Fighting Fantasy is a popular recreational fantasy gaming system worldwi...
research
07/29/2022

Optimistic and Topological Value Iteration for Simple Stochastic Games

While value iteration (VI) is a standard solution approach to simple sto...
research
12/18/2021

Playing Against Fair Adversaries in Stochastic Games with Total Rewards

We investigate zero-sum turn-based two-player stochastic games in which ...
research
05/03/2023

Solving irreducible stochastic mean-payoff games and entropy games by relative Krasnoselskii-Mann iteration

We analyse an algorithm solving stochastic mean-payoff games, combining ...
research
07/20/2022

A Lattice-Theoretical View of Strategy Iteration

Strategy iteration is a technique frequently used for two-player games i...
research
07/04/2012

Point-Based POMDP Algorithms: Improved Analysis and Implementation

Existing complexity bounds for point-based POMDP value iteration algorit...

Please sign up or login with your details

Forgot password? Click here to reset