Related papers: Value-Directed Sampling Methods for POMDPs

Value-Directed Belief State Approximation for POMDPs

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might approximate the belief state. Other schemes for belief-state…

Artificial Intelligence · Computer Science 2013-01-18 Pascal Poupart , Craig Boutilier

Value-Function Approximations for Partially Observable Markov Decision Processes

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes

Partially observable Markov decision processes (POMDPs) are a general mathematical model for sequential decision-making in stochastic environments under state uncertainty. POMDPs are often solved \textit{online}, which enables the algorithm…

Artificial Intelligence · Computer Science 2025-03-26 Yunuo Zhang , Baiting Luo , Ayan Mukhopadhyay , Abhishek Dubey

Point-Based Methods for Model Checking in Partially Observable Markov Decision Processes

Autonomous systems are often required to operate in partially observable environments. They must reliably execute a specified objective even with incomplete information about the state of the environment. We propose a methodology to…

Artificial Intelligence · Computer Science 2020-01-14 Maxime Bouton , Jana Tumova , Mykel J. Kochenderfer

Optimality Guarantees for Particle Belief Approximation of POMDPs

Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are…

Artificial Intelligence · Computer Science 2023-10-20 Michael H. Lim , Tyler J. Becker , Mykel J. Kochenderfer , Claire J. Tomlin , Zachary N. Sunberg

Improving Training Result of Partially Observable Markov Decision Process by Filtering Beliefs

In this study I proposed a filtering beliefs method for improving performance of Partially Observable Markov Decision Processes(POMDPs), which is a method wildly used in autonomous robot and many other domains concerning control policy. My…

Artificial Intelligence · Computer Science 2021-01-07 Oscar LiJen Hsu

Near Optimality of Finite Memory Feedback Policies in Partially Observed Markov Decision Processes

In the theory of Partially Observed Markov Decision Processes (POMDPs), existence of optimal policies have in general been established via converting the original partially observed stochastic control problem to a fully observed one on the…

Optimization and Control · Mathematics 2022-01-11 Ali Devran Kara , Serdar Yuksel

Vector-space Analysis of Belief-state Approximation for POMDPs

We propose a new approach to value-directed belief state approximation for POMDPs. The value-directed model allows one to choose approximation methods for belief state monitoring that have a small impact on decision quality. Using a vector…

Artificial Intelligence · Computer Science 2013-01-14 Pascal Poupart , Craig Boutilier

Partially Observed Optimal Stochastic Control: Regularity, Optimality, Approximations, and Learning

In this review/tutorial article, we present recent progress on optimal control of partially observed Markov Decision Processes (POMDPs). We first present regularity and continuity conditions for POMDPs and their belief-MDP reductions, where…

Optimization and Control · Mathematics 2025-01-03 Ali Devran Kara , Serdar Yuksel

Finite Model Approximations for Partially Observed Markov Decision Processes with Discounted Cost

We consider finite model approximations of discrete-time partially observed Markov decision processes (POMDPs) under the discounted cost criterion. After converting the original partially observed stochastic control problem to a fully…

Systems and Control · Computer Science 2017-10-20 Naci Saldi , Serdar Yüksel , Tamás Linder

Speeding Up the Convergence of Value Iteration in Partially Observable Markov Decision Processes

Partially observable Markov decision processes (POMDPs) have recently become popular among many AI researchers because they serve as a natural model for planning under uncertainty. Value iteration is a well-known algorithm for finding…

Artificial Intelligence · Computer Science 2011-06-02 N. L. Zhang , W. Zhang

Approximate Control for Continuous-Time POMDPs

This work proposes a decision-making framework for partially observable systems in continuous time with discrete state and action spaces. As optimal decision-making becomes intractable for large state spaces we employ approximation methods…

Machine Learning · Computer Science 2024-03-01 Yannick Eich , Bastian Alt , Heinz Koeppl

Anytime Point-Based Approximations for Large POMDPs

The Partially Observable Markov Decision Process has long been recognized as a rich framework for real-world planning and control problems, especially in robotics. However exact solutions in this framework are typically computationally…

Artificial Intelligence · Computer Science 2011-10-05 J. Pineau , G. Gordon , S. Thrun

Near Optimal Approximations and Finite Memory Policies for POMPDs with Continuous Spaces

We study an approximation method for partially observed Markov decision processes (POMDPs) with continuous spaces. Belief MDP reduction, which has been the standard approach to study POMDPs requires rigorous approximation methods for…

Optimization and Control · Mathematics 2025-01-20 Ali Devran Kara , Erhan Bayraktar , Serdar Yuksel

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this…

Systems and Control · Computer Science 2018-10-02 Bo Wu , Mohamadreza Ahmadi , Suda Bharadwaj , Ufuk Topcu

Perseus: Randomized Point-based Value Iteration for POMDPs

Partially observable Markov decision processes (POMDPs) form an attractive and principled framework for agent planning under uncertainty. Point-based approximate techniques for POMDPs compute a policy based on a finite set of points…

Artificial Intelligence · Computer Science 2011-09-13 M. T. J. Spaan , N. Vlassis

Semantic-aware Sampling and Transmission in Energy Harvesting Systems: A POMDP Approach

We address the problem of real-time remote tracking of a partially observable Markov source in an energy harvesting system with an unreliable communication channel. We consider both sampling and transmission costs. Different from most prior…

Signal Processing · Electrical Eng. & Systems 2024-10-07 Abolfazl Zakeri , Mohammad Moltafet , Marian Codreanu

Myopic Policy Bounds for Information Acquisition POMDPs

This paper addresses the problem of optimal control of robotic sensing systems aimed at autonomous information gathering in scenarios such as environmental monitoring, search and rescue, and surveillance and reconnaissance. The information…

Systems and Control · Computer Science 2016-01-28 Mikko Lauri , Nikolay Atanasov , George J. Pappas , Risto Ritala

Constrained Active Classification Using Partially Observable Markov Decision Processes

In this work, we study the problem of actively classifying the attributes of dynamical systems characterized as a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the…

Systems and Control · Electrical Eng. & Systems 2023-01-06 Bo Wu , Niklas Lauffer , Mohamadreza Ahmadi , Suda Bharadwaj , Zhe Xu , Ufuk Topcu

Model-Based Learning of Near-Optimal Finite-Window Policies in POMDPs

We study model-based learning of finite-window policies in tabular partially observable Markov decision processes (POMDPs). A common approach to learning under partial observability is to approximate unbounded history dependencies using…

Machine Learning · Computer Science 2026-04-02 Philip Jordan , Maryam Kamgarpour