Related papers: Synchronizing Objectives for Markov Decision Proce…

The Complexity of Synchronizing Markov Decision Processes

We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of…

Formal Languages and Automata Theory · Computer Science 2018-03-28 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Limit Synchronization in Markov Decision Processes

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing…

Computer Science and Game Theory · Computer Science 2013-11-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Robust Synchronization in Markov Decision Processes

We consider synchronizing properties of Markov decision processes (MDP), viewed as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in some…

Logic in Computer Science · Computer Science 2014-07-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Bounds for Synchronizing Markov Decision Processes

We consider Markov decision processes with synchronizing objectives, which require that a probability mass of $1-\epsilon$ accumulates in a designated set of target states, either once, always, infinitely often, or always from some point…

Logic in Computer Science · Computer Science 2022-04-28 Laurent Doyen , Marie van den Bogaard

Stochastic Games with Synchronizing Objectives

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and two-player deterministic games by adding…

Computer Science and Game Theory · Computer Science 2022-02-28 Laurent Doyen

Finite-Horizon Markov Decision Processes with Sequentially-Observed Transitions

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (or minimize…

Optimization and Control · Mathematics 2015-07-07 Mahmoud El Chamie , Behcet Acikmese

Unifying Two Views on Multiple Mean-Payoff Objectives in Markov Decision Processes

We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii)…

Logic in Computer Science · Computer Science 2019-03-14 Krishnendu Chatterjee , Zuzana Křetínská , Jan Křetínský

Mixing Probabilistic and non-Probabilistic Objectives in Markov Decision Processes

In this paper, we consider algorithms to decide the existence of strategies in MDPs for Boolean combinations of objectives. These objectives are omega-regular properties that need to be enforced either surely, almost surely, existentially,…

Logic in Computer Science · Computer Science 2020-04-30 Raphaël Berthon , Shibashis Guha , Jean-François Raskin

Bi-Objective Lexicographic Optimization in Markov Decision Processes with Related Objectives

We consider lexicographic bi-objective problems on Markov Decision Processes (MDPs), where we optimize one objective while guaranteeing optimality of another. We propose a two-stage technique for solving such problems when the objectives…

Computer Science and Game Theory · Computer Science 2023-08-17 Damien Busatto-Gaston , Debraj Chakraborty , Anirban Majumdar , Sayan Mukherjee , Guillermo A. Pérez , Jean-François Raskin

Finite-Memory Strategies in POMDPs with Long-Run Average Objectives

Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the…

Computer Science and Game Theory · Computer Science 2022-09-29 Krishnendu Chatterjee , Raimundo Saona , Bruno Ziliotto

Taming Infinity one Chunk at a Time: Concisely Represented Strategies in One-Counter MDPs

Markov decision processes (MDPs) are a canonical model to reason about decision making within a stochastic environment. We study a fundamental class of infinite MDPs: one-counter MDPs (OC-MDPs). They extend finite MDPs via an associated…

Computer Science and Game Theory · Computer Science 2025-03-04 Michal Ajdarów , James C. A. Main , Petr Novotný , Mickael Randour

Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces

In this work, we study dynamic programming (DP) algorithms for partially observable Markov decision processes with jointly continuous and discrete state-spaces. We consider a class of stochastic systems which have coupled discrete and…

Optimization and Control · Mathematics 2019-03-07 Donghwan Lee , Niao He , Jianghai Hu

Strengthening Deterministic Policies for POMDPs

The synthesis problem for partially observable Markov decision processes (POMDPs) is to compute a policy that satisfies a given specification. Such policies have to take the full execution history of a POMDP into account, rendering the…

Artificial Intelligence · Computer Science 2020-07-20 Leonore Winterer , Ralf Wimmer , Nils Jansen , Bernd Becker

Learning Algorithms for Verification of Markov Decision Processes

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Systems and Control · Electrical Eng. & Systems 2025-04-02 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelik , Vojtěch Forejt , Jan Křetínský , Marta Kwiatkowska , Tobias Meggendorfer , David Parker , Mateusz Ujma

On the Detection of Markov Decision Processes

We study the detection problem for a finite set of Markov decision processes (MDPs) where the MDPs have the same state and action spaces but possibly different probabilistic transition functions. Any one of these MDPs could be the model for…

Optimization and Control · Mathematics 2021-12-24 Xiaoming Duan , Yagiz Savas , Rui Yan , Zhe Xu , Ufuk Topcu

Markov Decision Processes with Multiple Long-run Average Objectives

We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We consider two different objectives, namely, expectation and satisfaction objectives. Given an MDP with k limit-average functions, in the…

Computer Science and Game Theory · Computer Science 2015-07-01 Tomáš Brázdil , Václav Brožek , Krishnendu Chatterjee , Vojtěch Forejt , Antonín Kučera

Simple Strategies in Multi-Objective MDPs (Technical Report)

We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining the Pareto front. We focus on strategies that are easy…

Logic in Computer Science · Computer Science 2020-02-18 Florent Delgrange , Joost-Pieter Katoen , Tim Quatmann , Mickael Randour

Finite-Horizon Markov Decision Processes with State Constraints

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (minimize…

Optimization and Control · Mathematics 2015-07-08 Mahmoud El Chamie , Behcet Acikmese

Solving Robust Markov Decision Processes: Generic, Reliable, Efficient

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Multi-objective Robust Strategy Synthesis for Interval Markov Decision Processes

Interval Markov decision processes (IMDPs) generalise classical MDPs by having interval-valued transition probabilities. They provide a powerful modelling tool for probabilistic systems with an additional variation or uncertainty that…

Systems and Control · Computer Science 2017-07-07 Ernst Moritz Hahn , Vahid Hashemi , Holger Hermanns , Morteza Lahijanian , Andrea Turrini