Abstract
It is shown that the discount factor needed to solve an undiscounted mean payoff stochastic game to optimality is exponentially close to 1, even in one-player games with a single random node and polynomially bounded rewards and transition probabilities. For the class of the so-called irreducible games with perfect information and a constant number of random nodes, we obtain a pseudo-polynomial algorithm using discounts.
| Original language | British English |
|---|---|
| Pages (from-to) | 357-362 |
| Number of pages | 6 |
| Journal | Operations Research Letters |
| Volume | 41 |
| Issue number | 4 |
| DOIs | |
| State | Published - 2013 |
Keywords
- Discounted stochastic games
- Markov decision processes
- Pseudo-polynomial algorithms
- Saddle point
- Zero-sum stochastic games
Fingerprint
Dive into the research topics of 'On discounted approximations of undiscounted stochastic games and Markov decision processes with limited randomness'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver