Related papers: One-Counter Markov Decision Processes

Taming Infinity one Chunk at a Time: Concisely Represented Strategies in One-Counter MDPs

Markov decision processes (MDPs) are a canonical model to reason about decision making within a stochastic environment. We study a fundamental class of infinite MDPs: one-counter MDPs (OC-MDPs). They extend finite MDPs via an associated…

Computer Science and Game Theory · Computer Science 2025-03-04 Michal Ajdarów , James C. A. Main , Petr Novotný , Mickael Randour

Approximating the Termination Value of One-Counter MDPs and Stochastic Games

One-counter MDPs (OC-MDPs) and one-counter simple stochastic games (OC-SSGs) are 1-player, and 2-player turn-based zero-sum, stochastic games played on the transition graph of classic one-counter automata (equivalently, pushdown automata…

Computer Science and Game Theory · Computer Science 2011-07-21 Tomáš Brázdil , Václav Brožek , Kousha Etessami , Antonín Kučera

What Are the Odds? Improving the foundations of Statistical Model Checking

Markov decision processes (MDPs) are a fundamental model for decision making under uncertainty. They exhibit non-deterministic choice as well as probabilistic uncertainty. Traditionally, verification algorithms assume exact knowledge of the…

Artificial Intelligence · Computer Science 2025-04-18 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

One-Counter Stochastic Games

We study the computational complexity of basic decision problems for one-counter simple stochastic games (OC-SSGs), under various objectives. OC-SSGs are 2-player turn-based stochastic games played on the transition graph of classic…

Computer Science and Game Theory · Computer Science 2010-09-29 Tomáš Brázdil , Václav Brožek , Kousha Etessami

Equivalence of Optimality Criteria for Markov Decision Process and Model Predictive Control

This paper shows that the optimal policy and value functions of a Markov Decision Process (MDP), either discounted or not, can be captured by a finite-horizon undiscounted Optimal Control Problem (OCP), even if based on an inexact model.…

Systems and Control · Electrical Eng. & Systems 2023-02-08 Arash Bahari Kordabad , Mario Zanon , Sebastien Gros

Quantum Markov Decision Processes: General Theory, Approximations, and Classes of Policies

In this paper, the aim is to develop a quantum counterpart to classical Markov decision processes (MDPs). Firstly, we provide a very general formulation of quantum MDPs with state and action spaces in the quantum domain, quantum…

Quantum Physics · Physics 2024-09-19 Naci Saldi , Sina Sanjari , Serdar Yuksel

Positivity-hardness results on Markov decision processes

This paper investigates a series of optimization problems for one-counter Markov decision processes (MDPs) and integer-weighted MDPs with finite state space. Specifically, it considers problems addressing termination probabilities and…

Logic in Computer Science · Computer Science 2024-08-07 Jakob Piribauer , Christel Baier

Qualitative Multi-Objective Reachability for Ordered Branching MDPs

We study qualitative multi-objective reachability problems for Ordered Branching Markov Decision Processes (OBMDPs), or equivalently context-free MDPs, building on prior results for single-target reachability on Branching Markov Decision…

Computer Science and Game Theory · Computer Science 2020-08-25 Kousha Etessami , Emanuel Martinov

Solving Robust Markov Decision Processes: Generic, Reliable, Efficient

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Decisiveness for countable MDPs and insights for NPLCSs and POMDPs

Markov chains and Markov decision processes (MDPs) are well-established probabilistic models. While finite Markov models are well-understood, analysing their infinite counterparts remains a significant challenge. Decisiveness has proven to…

Logic in Computer Science · Computer Science 2025-04-23 Nathalie Bertrand , Patricia Bouyer , Thomas Brihaye , Paulin Fournier , Pierre Vandenhove

Limit Synchronization in Markov Decision Processes

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing…

Computer Science and Game Theory · Computer Science 2013-11-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

On the Complexity of Robust Markov Decision Processes and Bisimulation Metrics

Robust Markov decision processes (RMDPs) extend standard Markov decision processes (MDPs) to account for uncertainty in the transition probabilities. RMDPs have an uncertainty set that defines a set of possible transition functions, each of…

Logic in Computer Science · Computer Science 2026-04-30 Marnix Suilen , Guillermo A. Pérez

Convex Approximations of Random Constrained Markov Decision Processes

Constrained Markov decision processes (CMDPs) are used as a decision-making framework to study the long-run performance of a stochastic system. It is well-known that a stationary optimal policy of a CMDP problem under discounted cost…

Optimization and Control · Mathematics 2025-06-02 V Varagapriya , Vikas Vikram Singh , Abdel Lisser

Scenario-Based Verification of Uncertain MDPs

We consider Markov decision processes (MDPs) in which the transition probabilities and rewards belong to an uncertainty set parametrized by a collection of random variables. The probability distributions for these random parameters are…

Logic in Computer Science · Computer Science 2020-02-26 Murat Cubuktepe , Nils Jansen , Sebastian Junges , Joost-Pieter Katoen , Ufuk Topcu

POMDPs under Probabilistic Semantics

We consider partially observable Markov decision processes (POMDPs) with limit-average payoff, where a reward value in the interval [0,1] is associated to every transition, and the payoff of an infinite path is the long-run average of the…

Artificial Intelligence · Computer Science 2013-08-23 Krishnendu Chatterjee , Martin Chmelík

POMDPs under Probabilistic Semantics

We consider partially observable Markov decision processes (POMDPs) with limit-average payoff, where a reward value in the interval [0,1] is associated to every transition, and the payoff of an infinite path is the long-run average of the…

Artificial Intelligence · Computer Science 2014-08-12 Krishnendu Chatterjee , Martin Chmelik

Chance and Mass Interpretations of Probabilities in Markov Decision Processes (Extended Version)

Markov decision processes (MDPs) are a popular model for decision-making in the presence of uncertainty. The conventional view of MDPs in verification treats them as state transformers with probabilities defined over sequences of states and…

Formal Languages and Automata Theory · Computer Science 2025-07-25 Yun Chen Tsai , Kittiphon Phalakarn , S. Akshay , Ichiro Hasuo

The Complexity of Synchronizing Markov Decision Processes

We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of…

Formal Languages and Automata Theory · Computer Science 2018-03-28 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Fast Online Exact Solutions for Deterministic MDPs with Sparse Rewards

Markov Decision Processes (MDPs) are a mathematical framework for modeling sequential decision making under uncertainty. The classical approaches for solving MDPs are well known and have been widely studied, some of which rely on…

Machine Learning · Computer Science 2018-05-18 Joshua R. Bertram , Xuxi Yang , Peng Wei

Lower Bound On the Computational Complexity of Discounted Markov Decision Problems

We study the computational complexity of the infinite-horizon discounted-reward Markov Decision Problem (MDP) with a finite state space $|\mathcal{S}|$ and a finite action space $|\mathcal{A}|$. We show that any randomized algorithm needs a…

Computational Complexity · Computer Science 2017-05-24 Yichen Chen , Mengdi Wang