A Fully Polynomial Time Approximation Scheme for Constrained MDPs Under Local Transitions

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    The fixed-horizon constrained Markov Decision Process (C-MDP) is a well-known model for planning in stochastic environments under operating constraints. Chance-constrained MDP (CC-MDP) is a variant that allows bounding the probability of constraint violation, which is desired in many safety-critical applications. CC-MDP can also model a class of MDPs, called Stochastic Shortest Path (SSP), under dead-ends, where there is a trade-off between the probability-to-goal and cost-to-goal. This work studies the structure of (C)C-MDP, particularly an important variant that involves local transition. In this variant, the state reachability exhibits a certain degree of locality and independence from the remaining states. More precisely, the number of states, at a given time, that share some reachable future states is always constant. (C)C-MDP under local transition is NP-Hard even for a planning horizon of two. In this work, we propose a fully polynomial-time approximation scheme for (C)C-MDP that computes (near) optimal deterministic policies. Such an algorithm is among the best approximation algorithms attainable in theory and gives insights into the approximability of constrained MDP and its variants.

    Original languageBritish English
    Title of host publication2023 62nd IEEE Conference on Decision and Control, CDC 2023
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages1782-1789
    Number of pages8
    ISBN (Electronic)9798350301243
    DOIs
    StatePublished - 2023
    Event62nd IEEE Conference on Decision and Control, CDC 2023 - Singapore, Singapore
    Duration: 13 Dec 202315 Dec 2023

    Publication series

    NameProceedings of the IEEE Conference on Decision and Control
    ISSN (Print)0743-1546
    ISSN (Electronic)2576-2370

    Conference

    Conference62nd IEEE Conference on Decision and Control, CDC 2023
    Country/TerritorySingapore
    CitySingapore
    Period13/12/2315/12/23

    Fingerprint

    Dive into the research topics of 'A Fully Polynomial Time Approximation Scheme for Constrained MDPs Under Local Transitions'. Together they form a unique fingerprint.

    Cite this