Related papers: Sequential Stochastic Optimization in Separable Le…

A Possibilistic Model for Qualitative Sequential Decision Problems under Uncertainty in Partially Observable Environments

In this article we propose a qualitative (ordinal) counterpart for the Partially Observable Markov Decision Processes model (POMDP) in which the uncertainty, as well as the preferences of the agent, are modeled by possibility distributions.…

Artificial Intelligence · Computer Science 2013-01-30 Regis Sabbadin

Partially Observable Markov Decision Processes (POMDPs) and Robotics

Planning under uncertainty is critical to robotics. The Partially Observable Markov Decision Process (POMDP) is a mathematical framework for such planning problems. It is powerful due to its careful quantification of the non-deterministic…

Robotics · Computer Science 2021-07-19 Hanna Kurniawati

A Tractable POMDP for a Class of Sequencing Problems

We consider a partially observable Markov decision problem (POMDP) that models a class of sequencing problems. Although POMDPs are typically intractable, our formulation admits tractable solution. Instead of maintaining a value function…

Artificial Intelligence · Computer Science 2013-01-14 Paat Rusmevichientong , Benjamin van Roy

Linear Programming for Decision Processes with Partial Information

Markov Decision Processes (MDPs) are stochastic optimization problems that model situations where a decision maker controls a system based on its state. Partially observed Markov decision processes (POMDPs) are generalizations of MDPs where…

Optimization and Control · Mathematics 2019-03-26 Victor Cohen , Axel Parmentier

POMDPs in Continuous Time and Discrete Spaces

Many processes, such as discrete event systems in engineering or population dynamics in biology, evolve in discrete space and continuous time. We consider the problem of optimal decision making in such discrete state and action space…

Machine Learning · Computer Science 2020-10-27 Bastian Alt , Matthias Schultheis , Heinz Koeppl

Toward Learning POMDPs Beyond Full-Rank Actions and State Observability

We are interested in enabling autonomous agents to learn and reason about systems with hidden states, such as locking mechanisms. We cast this problem as learning the parameters of a discrete Partially Observable Markov Decision Process…

Machine Learning · Computer Science 2026-02-04 Seiji Shaw , Travis Manderson , Chad Kessens , Nicholas Roy

Partially Observable Markov Decision Processes in Robotics: A Survey

Noisy sensing, imperfect control, and environment changes are defining characteristics of many real-world robot tasks. The partially observable Markov decision process (POMDP) provides a principled mathematical framework for modeling and…

Robotics · Computer Science 2022-09-22 Mikko Lauri , David Hsu , Joni Pajarinen

Partially Observable Markov Decision Process Framework for Operating Condition Optimization Using Real-Time Degradation Signals

In many engineering systems, proper predictive maintenance and operational control are essential to increase efficiency and reliability while reducing maintenance costs. However, one of the major challenges is that many sensors are used for…

Applications · Statistics 2025-12-09 Boyang Xu , Yunyi Kang , Xinyu Zhao , Hao Yan , Feng Ju

Value-Function Approximations for Partially Observable Markov Decision Processes

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

Structural Estimation of Partially Observable Markov Decision Processes

In many practical settings control decisions must be made under partial/imperfect information about the evolution of a relevant state variable. Partially Observable Markov Decision Processes (POMDPs) is a relatively well-developed framework…

Machine Learning · Computer Science 2021-12-30 Yanling Chang , Alfredo Garcia , Zhide Wang , Lu Sun

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker

Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

We study model-based learning of finite-window policies in tabular partially observable Markov decision processes (POMDPs). A common approach to learning under partial observability is to approximate unbounded history dependencies using…

Machine Learning · Computer Science 2026-04-02 Philip Jordan , Maryam Kamgarpour

Learning in POMDPs is Sample-Efficient with Hindsight Observability

POMDPs capture a broad class of decision making problems, but hardness results suggest that learning is intractable even in simple settings due to the inherent partial observability. However, in many realistic problems, more information is…

Machine Learning · Computer Science 2023-02-07 Jonathan N. Lee , Alekh Agarwal , Christoph Dann , Tong Zhang

Entropy-Regularized Partially Observed Markov Decision Processes

We investigate partially observed Markov decision processes (POMDPs) with cost functions regularized by entropy terms describing state, observation, and control uncertainty. Standard POMDP techniques are shown to offer bounded-error…

Systems and Control · Electrical Eng. & Systems 2023-05-10 Timothy L. Molloy , Girish N. Nair

Finite Model Approximations for Partially Observed Markov Decision Processes with Discounted Cost

We consider finite model approximations of discrete-time partially observed Markov decision processes (POMDPs) under the discounted cost criterion. After converting the original partially observed stochastic control problem to a fully…

Systems and Control · Computer Science 2017-10-20 Naci Saldi , Serdar Yüksel , Tamás Linder

Near-Optimal Partially Observable Reinforcement Learning with Partial Online State Information

Partially observable Markov decision processes (POMDPs) are a general framework for sequential decision-making under latent state uncertainty, yet learning in POMDPs is intractable in the worst case. Motivated by sensing and probing…

Machine Learning · Computer Science 2026-01-27 Ming Shi , Yingbin Liang , Ness B. Shroff

Weathering Ongoing Uncertainty: Learning and Planning in a Time-Varying Partially Observable Environment

Optimal decision-making presents a significant challenge for autonomous systems operating in uncertain, stochastic and time-varying environments. Environmental variability over time can significantly impact the system's optimal decision…

Robotics · Computer Science 2024-03-11 Gokul Puthumanaillam , Xiangyu Liu , Negar Mehr , Melkior Ornik

Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning

In this review/tutorial article, we present recent progress on optimal control of partially observed Markov Decision Processes (POMDPs). We first present regularity and continuity conditions for POMDPs and their belief-MDP reductions, where…

Optimization and Control · Mathematics 2025-01-03 Ali Devran Kara , Serdar Yuksel

Provable Reinforcement Learning with a Short-Term Memory

Real-world sequential decision making problems commonly involve partial observability, which requires the agent to maintain a memory of history in order to infer the latent states, plan and make good decisions. Coping with partial…

Machine Learning · Computer Science 2022-02-09 Yonathan Efroni , Chi Jin , Akshay Krishnamurthy , Sobhan Miryoosefi

Sequential Monte Carlo for Policy Optimization in Continuous POMDPs

Optimal decision-making under partial observability requires agents to balance reducing uncertainty (exploration) against pursuing immediate objectives (exploitation). In this paper, we introduce a novel policy optimization framework for…

Machine Learning · Computer Science 2025-12-05 Hany Abdulsamad , Sahel Iqbal , Simo Särkkä