Related papers: Dynamic Programming for Structured Continuous Mark…

Dynamic Programming for POMDP with Jointly Discrete and Continuous State-Spaces

In this work, we study dynamic programming (DP) algorithms for partially observable Markov decision processes with jointly continuous and discrete state-spaces. We consider a class of stochastic systems which have coupled discrete and…

Optimization and Control · Mathematics 2019-03-07 Donghwan Lee , Niao He , Jianghai Hu

Structural Results for Partially Observed Markov Decision Processes

This article provides an introductory tutorial on structural results in partially observed Markov decision processes (POMDPs). Typically, computing the optimal policy of a POMDP is computationally intractable. We use lattice program- ming…

Optimization and Control · Mathematics 2015-12-15 Vikram Krishnamurthy

SPUDD: Stochastic Planning using Decision Diagrams

Markov decisions processes (MDPs) are becoming increasing popular as models of decision theoretic planning. While traditional dynamic programming methods perform well for problems with small state spaces, structured methods are needed for…

Artificial Intelligence · Computer Science 2013-01-30 Jesse Hoey , Robert St-Aubin , Alan Hu , Craig Boutilier

Dynamic programming with incomplete information to overcome navigational uncertainty in a nautical environment

Using a novel toy nautical navigation environment, we show that dynamic programming can be used when only incomplete information about a partially observed Markov decision process (POMDP) is known. By incorporating uncertainty into our…

Optimization and Control · Mathematics 2022-07-20 Chris Beeler , Xinkai Li , Colin Bellinger , Mark Crowley , Maia Fraser , Isaac Tamblyn

Symbolic Dynamic Programming for Discrete and Continuous State MDPs

Many real-world decision-theoretic planning problems can be naturally modeled with discrete and continuous state Markov decision processes (DC-MDPs). While previous work has addressed automated decision-theoretic planning for DCMDPs,…

Artificial Intelligence · Computer Science 2012-02-20 Scott Sanner , Karina Valdivia Delgado , Leliane Nunes de Barros

Planning with Partially Observable Markov Decision Processes: Advances in Exact Solution Method

There is much interest in using partially observable Markov decision processes (POMDPs) as a formal model for planning in stochastic domains. This paper is concerned with finding optimal policies for POMDPs. We propose several improvements…

Artificial Intelligence · Computer Science 2013-02-01 Nevin Lianwen Zhang , Stephen S. Lee

Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties

We consider a broad class of dynamic programming (DP) problems that involve a partially linear structure and some positivity properties in their system equation and cost function. We address deterministic and stochastic problems, possibly…

Optimization and Control · Mathematics 2026-04-21 Yuchao Li , Dimitri Bertsekas

Efficient Solution Algorithms for Factored MDPs

This paper addresses the problem of planning under uncertainty in large Markov Decision Processes (MDPs). Factored MDPs represent a complex state space using state variables and the transition model using a dynamic Bayesian network. This…

Artificial Intelligence · Computer Science 2011-06-10 C. Guestrin , D. Koller , R. Parr , S. Venkataraman

Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes

Most exact algorithms for general partially observable Markov decision processes (POMDPs) use a form of dynamic programming in which a piecewise-linear and convex representation of one value function is transformed into another. We examine…

Artificial Intelligence · Computer Science 2013-02-08 Anthony R. Cassandra , Michael L. Littman , Nevin Lianwen Zhang

Cost-Bounded Active Classification Using Partially Observable Markov Decision Processes

Active classification, i.e., the sequential decision-making process aimed at data acquisition for classification purposes, arises naturally in many applications, including medical diagnosis, intrusion detection, and object tracking. In this…

Systems and Control · Computer Science 2018-10-02 Bo Wu , Mohamadreza Ahmadi , Suda Bharadwaj , Ufuk Topcu

Structural Estimation of Partially Observable Markov Decision Processes

In many practical settings control decisions must be made under partial/imperfect information about the evolution of a relevant state variable. Partially Observable Markov Decision Processes (POMDPs) is a relatively well-developed framework…

Machine Learning · Computer Science 2021-12-30 Yanling Chang , Alfredo Garcia , Zhide Wang , Lu Sun

PODDP: Partially Observable Differential Dynamic Programming for Latent Belief Space Planning

Autonomous agents are limited in their ability to observe the world state. Partially observable Markov decision processes (POMDPs) formally model the problem of planning under world state uncertainty, but POMDPs with continuous actions and…

Robotics · Computer Science 2020-07-08 Dicong Qiu , Yibiao Zhao , Chris L. Baker

Solving POMDPs by Searching the Space of Finite Policies

Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from…

Artificial Intelligence · Computer Science 2013-01-30 Nicolas Meuleau , Kee-Eung Kim , Leslie Pack Kaelbling , Anthony R. Cassandra

Experimental design for Partially Observed Markov Decision Processes

This paper deals with the question of how to most effectively conduct experiments in Partially Observed Markov Decision Processes so as to provide data that is most informative about a parameter of interest. Methods from Markov decision…

Other Statistics · Statistics 2018-01-31 Leifur Thorbergsson , Giles Hooker

Autonomous sPOMDP Environment Modeling With Partial Model Exploitation

A state space representation of an environment is a classic and yet powerful tool used by many autonomous robotic systems for efficient and often optimal solution planning. However, designing these representations with high performance is…

Machine Learning · Computer Science 2020-12-23 Andrew Wilhelm , Aaron Wilhelm , Garrett Fosdick

Solving Factored MDPs with Continuous and Discrete Variables

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods cannot adequately address these problems. We present the…

Artificial Intelligence · Computer Science 2012-07-19 Carlos E. Guestrin , Milos Hauskrecht , Branislav Kveton

Constrained Active Classification Using Partially Observable Markov Decision Processes

In this work, we study the problem of actively classifying the attributes of dynamical systems characterized as a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the…

Systems and Control · Electrical Eng. & Systems 2023-01-06 Bo Wu , Niklas Lauffer , Mohamadreza Ahmadi , Suda Bharadwaj , Zhe Xu , Ufuk Topcu

Decision-Theoretic Planning: Structural Assumptions and Computational Leverage

Planning under uncertainty is a central problem in the study of automated sequential decision making, and has been addressed by researchers in many different fields, including AI planning, decision analysis, operations research, control…

Artificial Intelligence · Computer Science 2011-05-30 C. Boutilier , T. Dean , S. Hanks

Linear Programming for Large-Scale Markov Decision Problems

We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost. Since it is intractable to compete with the optimal policy for large scale problems, we pursue the more modest…

Optimization and Control · Mathematics 2014-02-28 Yasin Abbasi-Yadkori , Peter L. Bartlett , Alan Malek

Finite-Horizon Markov Decision Processes with Sequentially-Observed Transitions

Markov Decision Processes (MDPs) have been used to formulate many decision-making problems in science and engineering. The objective is to synthesize the best decision (action selection) policies to maximize expected rewards (or minimize…

Optimization and Control · Mathematics 2015-07-07 Mahmoud El Chamie , Behcet Acikmese