English
Related papers

Related papers: Synchronizing Objectives for Markov Decision Proce…

200 papers

We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of…

Formal Languages and Automata Theory · Computer Science 2018-03-28 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing…

Computer Science and Game Theory · Computer Science 2013-11-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

We consider synchronizing properties of Markov decision processes (MDP), viewed as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in some…

Logic in Computer Science · Computer Science 2014-07-01 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

We consider Markov decision processes with synchronizing objectives, which require that a probability mass of $1-\epsilon$ accumulates in a designated set of target states, either once, always, infinitely often, or always from some point…

Logic in Computer Science · Computer Science 2022-04-28 Laurent Doyen , Marie van den Bogaard

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and two-player deterministic games by adding…

Computer Science and Game Theory · Computer Science 2022-02-28 Laurent Doyen

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (or minimize…

Optimization and Control · Mathematics 2015-07-07 Mahmoud El Chamie , Behcet Acikmese

We consider Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) objectives. There exist two different views: (i) the expectation semantics, where the goal is to optimize the expected mean-payoff objective, and (ii)…

Logic in Computer Science · Computer Science 2019-03-14 Krishnendu Chatterjee , Zuzana Křetínská , Jan Křetínský

In this paper, we consider algorithms to decide the existence of strategies in MDPs for Boolean combinations of objectives. These objectives are omega-regular properties that need to be enforced either surely, almost surely, existentially,…

Logic in Computer Science · Computer Science 2020-04-30 Raphaël Berthon , Shibashis Guha , Jean-François Raskin

We consider lexicographic bi-objective problems on Markov Decision Processes (MDPs), where we optimize one objective while guaranteeing optimality of another. We propose a two-stage technique for solving such problems when the objectives…

Computer Science and Game Theory · Computer Science 2023-08-17 Damien Busatto-Gaston , Debraj Chakraborty , Anirban Majumdar , Sayan Mukherjee , Guillermo A. Pérez , Jean-François Raskin

Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the…

Computer Science and Game Theory · Computer Science 2022-09-29 Krishnendu Chatterjee , Raimundo Saona , Bruno Ziliotto

Markov decision processes (MDPs) are a canonical model to reason about decision making within a stochastic environment. We study a fundamental class of infinite MDPs: one-counter MDPs (OC-MDPs). They extend finite MDPs via an associated…

Computer Science and Game Theory · Computer Science 2025-03-04 Michal Ajdarów , James C. A. Main , Petr Novotný , Mickael Randour

In this work, we study dynamic programming (DP) algorithms for partially observable Markov decision processes with jointly continuous and discrete state-spaces. We consider a class of stochastic systems which have coupled discrete and…

Optimization and Control · Mathematics 2019-03-07 Donghwan Lee , Niao He , Jianghai Hu

The synthesis problem for partially observable Markov decision processes (POMDPs) is to compute a policy that satisfies a given specification. Such policies have to take the full execution history of a POMDP into account, rendering the…

Artificial Intelligence · Computer Science 2020-07-20 Leonore Winterer , Ralf Wimmer , Nils Jansen , Bernd Becker

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

We study the detection problem for a finite set of Markov decision processes (MDPs) where the MDPs have the same state and action spaces but possibly different probabilistic transition functions. Any one of these MDPs could be the model for…

Optimization and Control · Mathematics 2021-12-24 Xiaoming Duan , Yagiz Savas , Rui Yan , Zhe Xu , Ufuk Topcu

We study Markov decision processes (MDPs) with multiple limit-average (or mean-payoff) functions. We consider two different objectives, namely, expectation and satisfaction objectives. Given an MDP with k limit-average functions, in the…

Computer Science and Game Theory · Computer Science 2015-07-01 Tomáš Brázdil , Václav Brožek , Krishnendu Chatterjee , Vojtěch Forejt , Antonín Kučera

We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining the Pareto front. We focus on strategies that are easy…

Logic in Computer Science · Computer Science 2020-02-18 Florent Delgrange , Joost-Pieter Katoen , Tim Quatmann , Mickael Randour

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (minimize…

Optimization and Control · Mathematics 2015-07-08 Mahmoud El Chamie , Behcet Acikmese

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Interval Markov decision processes (IMDPs) generalise classical MDPs by having interval-valued transition probabilities. They provide a powerful modelling tool for probabilistic systems with an additional variation or uncertainty that…

Systems and Control · Computer Science 2017-07-07 Ernst Moritz Hahn , Vahid Hashemi , Holger Hermanns , Morteza Lahijanian , Andrea Turrini
‹ Prev 1 2 3 10 Next ›