Related papers: Quantum POMDPs

Quantum Markov Decision Processes: General Theory, Approximations, and Classes of Policies

In this paper, the aim is to develop a quantum counterpart to classical Markov decision processes (MDPs). Firstly, we provide a very general formulation of quantum MDPs with state and action spaces in the quantum domain, quantum…

Quantum Physics · Physics 2024-09-19 Naci Saldi , Sina Sanjari , Serdar Yuksel

A Possibilistic Model for Qualitative Sequential Decision Problems under Uncertainty in Partially Observable Environments

In this article we propose a qualitative (ordinal) counterpart for the Partially Observable Markov Decision Processes model (POMDP) in which the uncertainty, as well as the preferences of the agent, are modeled by possibility distributions.…

Artificial Intelligence · Computer Science 2013-01-30 Regis Sabbadin

Value-Function Approximations for Partially Observable Markov Decision Processes

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

Partially Observable Markov Decision Processes (POMDPs) and Robotics

Planning under uncertainty is critical to robotics. The Partially Observable Markov Decision Process (POMDP) is a mathematical framework for such planning problems. It is powerful due to its careful quantification of the non-deterministic…

Robotics · Computer Science 2021-07-19 Hanna Kurniawati

Planning in Observable POMDPs in Quasipolynomial Time

Partially Observable Markov Decision Processes (POMDPs) are a natural and general model in reinforcement learning that take into account the agent's uncertainty about its current state. In the literature on POMDPs, it is customary to assume…

Machine Learning · Computer Science 2022-03-24 Noah Golowich , Ankur Moitra , Dhruv Rohatgi

Counterfactual equivalence for POMDPs, and underlying deterministic environments

Partially Observable Markov Decision Processes (POMDPs) are rich environments often used in machine learning. But the issue of information and causal structures in POMDPs has been relatively little studied. This paper presents the concepts…

Artificial Intelligence · Computer Science 2018-01-16 Stuart Armstrong

Reachability Analysis of Quantum Markov Decision Processes

We introduce the notion of quantum Markov decision process (qMDP) as a semantic model of nondeterministic and concurrent quantum programs. It is shown by examples that qMDPs can be used in analysis of quantum algorithms and protocols. We…

Quantum Physics · Physics 2014-07-10 Shenggang Ying , Mingsheng Ying

Interpreting systems as solving POMDPs: a step towards a formal understanding of agency

Under what circumstances can a system be said to have beliefs and goals, and how do such agency-related features relate to its physical state? Recent work has proposed a notion of interpretation map, a function that maps the state of a…

Artificial Intelligence · Computer Science 2025-07-14 Martin Biehl , Nathaniel Virgo

Multi-Environment POMDPs with Finite-Horizon Objectives

Partially Observable Markov Decision Processes (POMDPs) are systems in which one agent interacts with a stochastic environment, and receives only partial information about the current state. In a multi-environment POMDP (MEPOMDP), the…

Artificial Intelligence · Computer Science 2026-05-11 Léonard Brice , Filip Cano , Krishnendu Chatterjee , Thomas A. Henzinger , Stefanie Muroya

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Partially observable Markov decision processes (POMDPs) form a prominent model for uncertainty in sequential decision making. We are interested in constructing algorithms with theoretical guarantees to determine whether the agent has a…

Artificial Intelligence · Computer Science 2024-12-17 Marius Belly , Nathanaël Fijalkow , Hugo Gimbert , Florian Horn , Guillermo A. Pérez , Pierre Vandenhove

Qualitative Possibilistic Mixed-Observable MDPs

Possibilistic and qualitative POMDPs (pi-POMDPs) are counterparts of POMDPs used to model situations where the agent's initial belief or observation probabilities are imprecise due to lack of past experiences or insufficient data…

Artificial Intelligence · Computer Science 2013-09-27 Nicolas Drougard , Florent Teichteil-Konigsbuch , Jean-Loup Farges , Didier Dubois

Human-in-the-Loop Synthesis for Partially Observable Markov Decision Processes

We study planning problems where autonomous agents operate inside environments that are subject to uncertainties and not fully observable. Partially observable Markov decision processes (POMDPs) are a natural formal model to capture such…

Artificial Intelligence · Computer Science 2018-02-28 Steven Carr , Nils Jansen , Ralf Wimmer , Jie Fu , Ufuk Topcu

Point-Based Methods for Model Checking in Partially Observable Markov Decision Processes

Autonomous systems are often required to operate in partially observable environments. They must reliably execute a specified objective even with incomplete information about the state of the environment. We propose a methodology to…

Artificial Intelligence · Computer Science 2020-01-14 Maxime Bouton , Jana Tumova , Mykel J. Kochenderfer

A Model Approximation Scheme for Planning in Partially Observable Stochastic Domains

Partially observable Markov decision processes (POMDPs) are a natural model for planning problems where effects of actions are nondeterministic and the state of the world is not completely observable. It is difficult to solve POMDPs…

Artificial Intelligence · Computer Science 2009-09-25 N. L. Zhang , W. Liu

Solving POMDPs by Searching the Space of Finite Policies

Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from…

Artificial Intelligence · Computer Science 2013-01-30 Nicolas Meuleau , Kee-Eung Kim , Leslie Pack Kaelbling , Anthony R. Cassandra

Finite-Memory Strategies in POMDPs with Long-Run Average Objectives

Partially observable Markov decision processes (POMDPs) are standard models for dynamic systems with probabilistic and nondeterministic behaviour in uncertain environments. We prove that in POMDPs with long-run average objective, the…

Computer Science and Game Theory · Computer Science 2022-09-29 Krishnendu Chatterjee , Raimundo Saona , Bruno Ziliotto

Finite Model Approximations for Partially Observed Markov Decision Processes with Discounted Cost

We consider finite model approximations of discrete-time partially observed Markov decision processes (POMDPs) under the discounted cost criterion. After converting the original partially observed stochastic control problem to a fully…

Systems and Control · Computer Science 2017-10-20 Naci Saldi , Serdar Yüksel , Tamás Linder

A Framework for Sequential Planning in Multi-Agent Settings

This paper extends the framework of partially observable Markov decision processes (POMDPs) to multi-agent settings by incorporating the notion of agent models into the state space. Agents maintain beliefs over physical states of the…

Artificial Intelligence · Computer Science 2011-09-13 P. Doshi , P. J. Gmytrasiewicz

Approximation Methods for Partially Observed Markov Decision Processes (POMDPs)

POMDPs are useful models for systems where the true underlying state is not known completely to an outside observer; the outside observer incompletely knows the true state of the system, and observes a noisy version of the true system…

Machine Learning · Computer Science 2021-09-01 Caleb M. Bowyer

Computing the Reachability Value of Posterior-Deterministic POMDPs

Partially observable Markov decision processes (POMDPs) are a fundamental model for sequential decision-making under uncertainty. However, many verification and synthesis problems for POMDPs are undecidable or intractable. Most prominently,…

Artificial Intelligence · Computer Science 2026-04-23 Nathanaël Fijalkow , Arka Ghosh , Roman Kniazev , Guillermo A. Pérez , Pierre Vandenhove