TY - JOUR
T1 - A Potential Reduction Algorithm for Two-Person Zero-Sum Mean Payoff Stochastic Games
AU - Boros, Endre
AU - Elbassioni, Khaled
AU - Gurvich, Vladimir
AU - Makino, Kazuhisa
N1 - Funding Information:
This research was partially supported by DIMACS, Center for Discrete Mathematics and Theoretical Computer Science, Rutgers University, and by the Scientific Grant-in-Aid from Ministry of Education, Science, Sports and Culture of Japan. Part of this research was done at the Mathematisches Forschungsinstitut Oberwolfach during stays within the Research in Pairs Program. The first author also acknowledges the partial support of NSF Grant IIS-1161476. The third author was partially funded by the Russian Academic Excellence Project ‘5-100’. An extended abstract of this paper was published in the proceedings of the Combinatorial Optimization and Applications, 2014 [8].
Publisher Copyright:
© 2016, Springer Science+Business Media New York.
PY - 2018/3/1
Y1 - 2018/3/1
N2 - We suggest a new algorithm for two-person zero-sum undiscounted stochastic games focusing on stationary strategies. Given a positive real ε, let us call a stochastic game ε-ergodic, if its values from any two initial positions differ by at most ε. The proposed new algorithm outputs for every ε> 0 in finite time either a pair of stationary strategies for the two players guaranteeing that the values from any initial positions are within an ε-range, or identifies two initial positions u and v and corresponding stationary strategies for the players proving that the game values starting from u and v are at least ε/ 24 apart. In particular, the above result shows that if a stochastic game is ε-ergodic, then there are stationary strategies for the players proving 24 ε-ergodicity. This result strengthens and provides a constructive version of an existential result by Vrieze (Stochastic games with finite state and action spaces. PhD thesis, Centrum voor Wiskunde en Informatica, Amsterdam, 1980) claiming that if a stochastic game is 0-ergodic, then there are ε-optimal stationary strategies for every ε> 0. The suggested algorithm is based on a potential transformation technique that changes the range of local values at all positions without changing the normal form of the game.
AB - We suggest a new algorithm for two-person zero-sum undiscounted stochastic games focusing on stationary strategies. Given a positive real ε, let us call a stochastic game ε-ergodic, if its values from any two initial positions differ by at most ε. The proposed new algorithm outputs for every ε> 0 in finite time either a pair of stationary strategies for the two players guaranteeing that the values from any initial positions are within an ε-range, or identifies two initial positions u and v and corresponding stationary strategies for the players proving that the game values starting from u and v are at least ε/ 24 apart. In particular, the above result shows that if a stochastic game is ε-ergodic, then there are stationary strategies for the players proving 24 ε-ergodicity. This result strengthens and provides a constructive version of an existential result by Vrieze (Stochastic games with finite state and action spaces. PhD thesis, Centrum voor Wiskunde en Informatica, Amsterdam, 1980) claiming that if a stochastic game is 0-ergodic, then there are ε-optimal stationary strategies for every ε> 0. The suggested algorithm is based on a potential transformation technique that changes the range of local values at all positions without changing the normal form of the game.
KW - Computational game theory
KW - Limiting average payoff
KW - Local reward
KW - Mean payoff
KW - Potential transformation
KW - Undiscounted stochastic games
UR - http://www.scopus.com/inward/record.url?scp=85042236427&partnerID=8YFLogxK
U2 - 10.1007/s13235-016-0199-x
DO - 10.1007/s13235-016-0199-x
M3 - Article
AN - SCOPUS:85042236427
SN - 2153-0785
VL - 8
SP - 22
EP - 41
JO - Dynamic Games and Applications
JF - Dynamic Games and Applications
IS - 1
ER -