Related papers: Robust Synchronization in Markov Decision Processe…

The Complexity of Synchronizing Markov Decision Processes

We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of…

Formal Languages and Automata Theory · Computer Science 2018-03-28 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Limit Synchronization in Markov Decision Processes

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing…

Computer Science and Game Theory · Computer Science 2013-11-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Synchronizing Objectives for Markov Decision Processes

We introduce synchronizing objectives for Markov decision processes (MDP). Intuitively, a synchronizing objective requires that eventually, at every step there is a state which concentrates almost all the probability mass. In particular, it…

Logic in Computer Science · Computer Science 2011-02-22 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Bounds for Synchronizing Markov Decision Processes

We consider Markov decision processes with synchronizing objectives, which require that a probability mass of $1-\epsilon$ accumulates in a designated set of target states, either once, always, infinitely often, or always from some point…

Logic in Computer Science · Computer Science 2022-04-28 Laurent Doyen , Marie van den Bogaard

Solving Robust Markov Decision Processes: Generic, Reliable, Efficient

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Stochastic Games with Synchronizing Objectives

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and two-player deterministic games by adding…

Computer Science and Game Theory · Computer Science 2022-02-28 Laurent Doyen

Distributionally Robust Optimization for Sequential Decision Making

The distributionally robust Markov Decision Process (MDP) approach asks for a distributionally robust policy that achieves the maximal expected total reward under the most adversarial distribution of uncertain parameters. In this paper, we…

Systems and Control · Computer Science 2018-10-10 Zhi Chen , Pengqian Yu , William B. Haskell

Strengthening Deterministic Policies for POMDPs

The synthesis problem for partially observable Markov decision processes (POMDPs) is to compute a policy that satisfies a given specification. Such policies have to take the full execution history of a POMDP into account, rendering the…

Artificial Intelligence · Computer Science 2020-07-20 Leonore Winterer , Ralf Wimmer , Nils Jansen , Bernd Becker

Combinations of Qualitative Winning for Stochastic Parity Games

We study Markov decision processes and turn-based stochastic games with parity conditions. There are three qualitative winning criteria, namely, sure winning, which requires all paths must satisfy the condition, almost-sure winning, which…

Logic in Computer Science · Computer Science 2018-04-11 Krishnendu Chatterjee , Nir Piterman

Twice Regularized Markov Decision Processes: The Equivalence between Robustness and Regularization

Robust Markov decision processes (MDPs) aim to handle changing or partially known system dynamics. To solve them, one typically resorts to robust optimization methods. However, this significantly increases computational complexity and…

Machine Learning · Computer Science 2023-03-14 Esther Derman , Yevgeniy Men , Matthieu Geist , Shie Mannor

Robust Entropy-regularized Markov Decision Processes

Stochastic and soft optimal policies resulting from entropy-regularized Markov decision processes (ER-MDP) are desirable for exploration and imitation learning applications. Motivated by the fact that such policies are sensitive with…

Machine Learning · Computer Science 2022-01-03 Tien Mai , Patrick Jaillet

Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance

This paper investigates the optimization problem of an infinite stage discrete time Markov decision process (MDP) with a long-run average metric considering both mean and variance of rewards together. Such performance metric is important…

Optimization and Control · Mathematics 2020-08-11 Li Xia

Distributionally robust chance-constrained Markov decision processes

Markov decision process (MDP) is a decision making framework where a decision maker is interested in maximizing the expected discounted value of a stream of rewards received at future stages at various states which are visited according to…

Optimization and Control · Mathematics 2022-12-19 Hoang Nam Nguyen , Abdel Lisser , Vikas Vikram Singh

Sliding Window Codes: Near-Optimality and Q-Learning for Zero-Delay Coding

We study the problem of zero-delay coding for the transmission of a Markov source over a noisy channel with feedback and present a reinforcement learning solution which is guaranteed to achieve near-optimality. To this end, we formulate the…

Optimization and Control · Mathematics 2025-10-07 Liam Cregg , Fady Alajaji , Serdar Yuksel

MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives

Markov decision processes can be viewed as transformers of probability distributions. While this view is useful from a practical standpoint to reason about trajectories of distributions, basic reachability and safety problems are known to…

Logic in Computer Science · Computer Science 2023-05-29 S. Akshay , Krishnendu Chatterjee , Tobias Meggendorfer , Đorđe Žikelić

Accelerating Policy Synthesis in Large-Scale MDPs via Hierarchical Adaptive Refinement

Software-intensive systems, such as software product lines and robotics, utilise Markov decision processes (MDPs) to capture uncertainty and analyse sequential decision-making problems. Despite the usefulness of conventional policy…

Artificial Intelligence · Computer Science 2026-05-01 Alexandros Evangelidis , Gricel Vázquez , Simos Gerasimou

Twice regularized MDPs and the equivalence between robustness and regularization

Robust Markov decision processes (MDPs) aim to handle changing or partially known system dynamics. To solve them, one typically resorts to robust optimization methods. However, this significantly increases computational complexity and…

Machine Learning · Computer Science 2021-10-14 Esther Derman , Matthieu Geist , Shie Mannor

Policies Grow on Trees: Model Checking Families of MDPs

Markov decision processes (MDPs) provide a fundamental model for sequential decision making under process uncertainty. A classical synthesis task is to compute for a given MDP a winning policy that achieves a desired specification. However,…

Logic in Computer Science · Computer Science 2024-07-18 Roman Andriushchenko , Milan Češka , Sebastian Junges , Filip Macák

Finite-memory Strategies for Almost-sure Energy-MeanPayoff Objectives in MDPs

We consider finite-state Markov decision processes with the combined Energy-MeanPayoff objective. The controller tries to avoid running out of energy while simultaneously attaining a strictly positive mean payoff in a second dimension. We…

Computer Science and Game Theory · Computer Science 2025-10-13 Mohan Dantam , Richard Mayr

Robust Markov Decision Processes on Continuous State Spaces

We study infinite-horizon robust Markov decision processes (MDPs) on continuous state spaces with structured rectangular ambiguity set. The proposed ambiguity set falls within the convex hull of unknown generating kernels. We utilize the…

Optimization and Control · Mathematics 2026-05-28 Mengmeng Li , Yifan Hu , Daniel Kuhn , Yan Li