English
Related papers

Related papers: Technical Report: The Policy Graph Improvement Alg…

200 papers

There is much interest in using partially observable Markov decision processes (POMDPs) as a formal model for planning in stochastic domains. This paper is concerned with finding optimal policies for POMDPs. We propose several improvements…

Artificial Intelligence · Computer Science 2013-02-01 Nevin Lianwen Zhang , Stephen S. Lee

Most exact algorithms for general partially observable Markov decision processes (POMDPs) use a form of dynamic programming in which a piecewise-linear and convex representation of one value function is transformed into another. We examine…

Artificial Intelligence · Computer Science 2013-02-08 Anthony R. Cassandra , Michael L. Littman , Nevin Lianwen Zhang

Partially observable Markov decision processes (POMDPs) are a natural model for planning problems where effects of actions are nondeterministic and the state of the world is not completely observable. It is difficult to solve POMDPs…

Artificial Intelligence · Computer Science 2009-09-25 N. L. Zhang , W. Liu

This article provides an introductory tutorial on structural results in partially observed Markov decision processes (POMDPs). Typically, computing the optimal policy of a POMDP is computationally intractable. We use lattice program- ming…

Optimization and Control · Mathematics 2015-12-15 Vikram Krishnamurthy

Solving partially observable Markov decision processes (POMDPs) is highly intractable in general, at least in part because the optimal policy may be infinitely large. In this paper, we explore the problem of finding the optimal policy from…

Artificial Intelligence · Computer Science 2013-01-30 Nicolas Meuleau , Kee-Eung Kim , Leslie Pack Kaelbling , Anthony R. Cassandra

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

This paper addresses the problem of optimal control of robotic sensing systems aimed at autonomous information gathering in scenarios such as environmental monitoring, search and rescue, and surveillance and reconnaissance. The information…

Systems and Control · Computer Science 2016-01-28 Mikko Lauri , Nikolay Atanasov , George J. Pappas , Risto Ritala

It is well known that for any finite state Markov decision process (MDP) there is a memoryless deterministic policy that maximizes the expected reward. For partially observable Markov decision processes (POMDPs), optimal memoryless policies…

Optimization and Control · Mathematics 2016-02-16 Guido Montufar , Keyan Ghazi-Zahedi , Nihat Ay

We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition. The traditional optimization objective (stochastic shortest path) asks to minimize…

Artificial Intelligence · Computer Science 2016-05-12 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelík , Anchit Gupta , Petr Novotný

We consider the problem of finding the best memoryless stochastic policy for an infinite-horizon partially observable Markov decision process (POMDP) with finite state and action spaces with respect to either the discounted or mean reward…

Optimization and Control · Mathematics 2022-05-02 Johannes Müller , Guido Montúfar

Currently, large partially observable Markov decision processes (POMDPs) are often solved by sampling-based online methods which interleave planning and execution phases. However, a pre-computed offline policy is more desirable in POMDP…

Artificial Intelligence · Computer Science 2025-07-29 Yang You , Vincent Thomas , Alex Schutz , Robert Skilton , Nick Hawes , Olivier Buffet

We present a technique for speeding up the convergence of value iteration for partially observable Markov decisions processes (POMDPs). The underlying idea is similar to that behind modified policy iteration for fully observable Markov…

Artificial Intelligence · Computer Science 2013-01-30 Nevin Lianwen Zhang , Stephen S. Lee , Weihong Zhang

In this paper, we propose a policy gradient method for confounded partially observable Markov decision processes (POMDPs) with continuous state and observation spaces in the offline setting. We first establish a novel identification result…

Machine Learning · Statistics 2023-12-04 Mao Hong , Zhengling Qi , Yanxun Xu

Partially observable Markov decision processes (POMDPs) have recently become popular among many AI researchers because they serve as a natural model for planning under uncertainty. Value iteration is a well-known algorithm for finding…

Artificial Intelligence · Computer Science 2011-06-02 N. L. Zhang , W. Zhang

The synthesis problem for partially observable Markov decision processes (POMDPs) is to compute a policy that satisfies a given specification. Such policies have to take the full execution history of a POMDP into account, rendering the…

Artificial Intelligence · Computer Science 2020-07-20 Leonore Winterer , Ralf Wimmer , Nils Jansen , Bernd Becker

Strategies for partially observable Markov decision processes (POMDP) typically require memory. One way to represent this memory is via automata. We present a method to learn an automaton representation of a strategy using a modification of…

Artificial Intelligence · Computer Science 2024-10-03 Alexander Bork , Debraj Chakraborty , Kush Grover , Jan Kretinsky , Stefanie Mohr

This paper introduces algorithms for problems where a decision maker has to control a system composed of several components and has access to only partial information on the state of each component. Such problems are difficult because of…

Optimization and Control · Mathematics 2020-12-25 Victor Cohen , Axel Parmentier

Planning under uncertainty is critical to robotics. The Partially Observable Markov Decision Process (POMDP) is a mathematical framework for such planning problems. It is powerful due to its careful quantification of the non-deterministic…

Robotics · Computer Science 2021-07-19 Hanna Kurniawati

Partially Observable Markov Decision Processes (POMDPs) provide an efficient way to model real-world sequential decision making processes. Motivated by the problem of maintenance and inspection of a group of infrastructure components with…

Optimization and Control · Mathematics 2024-08-15 Manav Vora , Pranay Thangeda , Michael N. Grussing , Melkior Ornik

In many practical settings control decisions must be made under partial/imperfect information about the evolution of a relevant state variable. Partially Observable Markov Decision Processes (POMDPs) is a relatively well-developed framework…

Machine Learning · Computer Science 2021-12-30 Yanling Chang , Alfredo Garcia , Zhide Wang , Lu Sun
‹ Prev 1 2 3 10 Next ›