Abstract
It is shown that the discount factor needed to solve an undiscounted mean payoff stochastic game to optimality is exponentially close to 1, even in one-player games with a single random node and polynomially bounded rewards and transition probabilities. For the class of the so-called irreducible games with perfect information and a constant number of random nodes, we obtain a pseudo-polynomial algorithm using discounts.
Original language | British English |
---|---|
Pages (from-to) | 357-362 |
Number of pages | 6 |
Journal | Operations Research Letters |
Volume | 41 |
Issue number | 4 |
DOIs | |
State | Published - 2013 |
Keywords
- Discounted stochastic games
- Markov decision processes
- Pseudo-polynomial algorithms
- Saddle point
- Zero-sum stochastic games