English
Related papers

Related papers: POMDPs under Probabilistic Semantics

200 papers

We consider partially observable Markov decision processes (POMDPs) with limit-average payoff, where a reward value in the interval [0,1] is associated to every transition, and the payoff of an infinite path is the long-run average of the…

Artificial Intelligence · Computer Science 2014-08-12 Krishnendu Chatterjee , Martin Chmelik

We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Every infinite run induces the following sequences of payoffs: 1. Point payoff (the sequence of directly seen transition rewards), 2. Mean…

Computational Complexity · Computer Science 2023-06-22 Richard Mayr , Eric Munday

We study countably infinite Markov decision processes (MDPs) with real-valued transition rewards. Every infinite run induces the following sequences of payoffs: 1. Point payoff (the sequence of directly seen transition rewards), 2. Total…

Artificial Intelligence · Computer Science 2021-07-13 Richard Mayr , Eric Munday

Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the…

Computer Science and Game Theory · Computer Science 2022-09-29 Krishnendu Chatterjee , Raimundo Saona , Bruno Ziliotto

We consider concurrent games played by two-players on a finite-state graph, where in every round the players simultaneously choose a move, and the current state along with the joint moves determine the successor state. We study a…

Computer Science and Game Theory · Computer Science 2014-09-19 Krishnendu Chatterjee , Rasmus Ibsen-Jensen

We consider partially observable Markov decision processes (POMDPs) with {\omega}-regular conditions specified as parity objectives. The class of {\omega}-regular languages extends regular languages to infinite strings and provides a robust…

Logic in Computer Science · Computer Science 2013-09-12 Krishnendu Chatterjee , Martin Chmelik , Mathieu Tracol

We give polynomial-time algorithms for computing the values of Markov decision processes (MDPs) with limsup and liminf objectives. A real-valued reward is assigned to each state, and the value of an infinite path in the MDP is the limsup…

Computer Science and Game Theory · Computer Science 2008-09-10 Krishnendu Chatterjee , Thomas A. Henzinger

We study observation-based strategies for partially-observable Markov decision processes (POMDPs) with omega-regular objectives. An observation-based strategy relies on partial information about the history of a play, namely, on the past…

Logic in Computer Science · Computer Science 2015-05-14 Krishnendu Chatterjee , Laurent Doyen , Thomas A. Henzinger

We consider the problem: is the optimal expected total reward to reach a goal state in a partially observable Markov decision process (POMDP) below a given threshold? We tackle this -- generally undecidable -- problem by computing…

Artificial Intelligence · Computer Science 2022-01-24 Alexander Bork , Joost-Pieter Katoen , Tim Quatmann

The paper deals with finite-state Markov decision processes (MDPs) with integer weights assigned to each state-action pair. New algorithms are presented to classify end components according to their limiting behavior with respect to the…

Logic in Computer Science · Computer Science 2018-05-01 Christel Baier , Nathalie Bertrand , Clemens Dubslaff , Daniel Gburek , Ocan Sankur

We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition. The traditional optimization objective (stochastic shortest path) asks to minimize…

Artificial Intelligence · Computer Science 2016-05-12 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelík , Anchit Gupta , Petr Novotný

Partially observable Markov decision processes (POMDPs) are a central model for uncertainty in sequential decision making. The most basic objective is the reachability objective, where a target set must be eventually visited, and the more…

Computational Complexity · Computer Science 2025-12-09 Ali Asadi , Krishnendu Chatterjee , David Lurie , Raimundo Saona

Markov decision processes (MDPs) with rewards are a widespread and well-studied model for systems that make both probabilistic and nondeterministic choices. A fundamental result about MDPs is that their minimal and maximal expected rewards…

Logic in Computer Science · Computer Science 2024-11-26 Kevin Batz , Benjamin Lucien Kaminski , Christoph Matheja , Tobias Winkler

We consider partially observable Markov decision processes (POMDPs) with a set of target states and every transition is associated with an integer cost. The optimization objective we study asks to minimize the expected total cost till the…

Artificial Intelligence · Computer Science 2014-11-17 Krishnendu Chatterjee , Martin Chmelík , Raghav Gupta , Ayush Kanodia

Partially observable Markov decision processes (POMDPs) provide a modeling framework for a variety of sequential decision making under uncertainty scenarios in artificial intelligence (AI). Since the states are not directly observable in a…

Systems and Control · Computer Science 2019-05-21 Mohamadreza Ahmadi , Nils Jansen , Bo Wu , Ufuk Topcu

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

We study Markov decision processes and turn-based stochastic games with parity conditions. There are three qualitative winning criteria, namely, sure winning, which requires all paths must satisfy the condition, almost-sure winning, which…

Logic in Computer Science · Computer Science 2018-04-11 Krishnendu Chatterjee , Nir Piterman

Partially-Observable Markov Decision Processes (POMDPs) are a well-known stochastic model for sequential decision making under limited information. We consider the EXPTIME-hard problem of synthesising policies that almost-surely reach some…

Artificial Intelligence · Computer Science 2021-03-22 Sebastian Junges , Nils Jansen , Sanjit A. Seshia

We study discrete-time discounted constrained Markov decision processes (CMDPs) on Borel spaces with unbounded reward functions. In our approach the transition probability functions are weakly or set-wise continuous. The reward functions…

Optimization and Control · Mathematics 2019-03-29 Eugene A. Feinberg , Anna Jaśkiewicz , Andrzej S. Nowak

In this article we propose a qualitative (ordinal) counterpart for the Partially Observable Markov Decision Processes model (POMDP) in which the uncertainty, as well as the preferences of the agent, are modeled by possibility distributions.…

Artificial Intelligence · Computer Science 2013-01-30 Regis Sabbadin
‹ Prev 1 2 3 10 Next ›