Abstract
We consider two-person zero-sum stochastic mean payoff games with perfect information, or BWR-games, given by a digraph G= (V, E) , with local rewards r:E→Z, and three types of positions: black VB, white VW, and random VR forming a partition of V. It is a long-standing open question whether a polynomial time algorithm for BWR-games exists, even when | VR| = 0. In fact, a pseudo-polynomial algorithm for BWR-games would already imply their polynomial solvability. In this short note, we show that BWR-games can be solved via convex programming in pseudo-polynomial time if the number of random positions is a constant.
Original language | British English |
---|---|
Pages (from-to) | 1499-1512 |
Number of pages | 14 |
Journal | Optimization Letters |
Volume | 11 |
Issue number | 8 |
DOIs | |
State | Published - 1 Dec 2017 |
Keywords
- Convex programming
- Mean payoff
- Perfect information
- Pseudo-polynomial algorithm
- Stochastic games