English
Related papers

Related papers: Point-Based POMDP Algorithms: Improved Analysis an…

200 papers

Partially Observable Markov Decision Processes (POMDPs) are powerful models for sequential decision making under transition and observation uncertainties. This paper studies the challenging yet important problem in POMDPs known as the…

Artificial Intelligence · Computer Science 2024-06-06 Qi Heng Ho , Martin S. Feather , Federico Rossi , Zachary N. Sunberg , Morteza Lahijanian

Most algorithms for solving POMDPs iteratively improve a value function that implicitly represents a policy and are said to search in value function space. This paper presents an approach to solving POMDPs that represents a policy…

Artificial Intelligence · Computer Science 2013-02-01 Eric A. Hansen

Value iteration is a popular algorithm for finding near optimal policies for POMDPs. It is inefficient due to the need to account for the entire belief space, which necessitates the solution of large numbers of linear programs. In this…

Artificial Intelligence · Computer Science 2011-07-04 N. L. Zhang , W. Zhang

We propose a new point-based method for approximate planning in Dec-POMDP which outperforms the state-of-the-art approaches in terms of solution quality. It uses a heuristic estimation of the prior probability of beliefs to choose a bounded…

Artificial Intelligence · Computer Science 2012-03-19 Gabriel Corona , Francois Charpillet

Partially observable Markov decision processes (POMDPs) form an attractive and principled framework for agent planning under uncertainty. Point-based approximate techniques for POMDPs compute a policy based on a finite set of points…

Artificial Intelligence · Computer Science 2011-09-13 M. T. J. Spaan , N. Vlassis

Solving partially observable Markov decision processes (POMDPs) typically requires reasoning about the values of exponentially many state beliefs. Towards practical performance, state-of-the-art solvers use value bounds to guide this…

Artificial Intelligence · Computer Science 2025-02-11 Merlijn Krale , Wietze Koops , Sebastian Junges , Thiago D. Simão , Nils Jansen

We consider the problem of finding good finite-horizon policies for POMDPs under the expected reward metric. The policies considered are {em free finite-memory policies with limited memory}; a policy is a mapping from the space of…

Artificial Intelligence · Computer Science 2013-01-30 Christopher Lusena , Tong Li , Shelia Sittinger , Chris Wells , Judy Goldsmith

We present a novel POMDP planning algorithm called heuristic search value iteration (HSVI).HSVI is an anytime algorithm that returns a policy and a provable bound on its regret with respect to the optimal policy. HSVI gets its power by…

Artificial Intelligence · Computer Science 2012-07-19 Trey Smith , Reid Simmons

This paper introduces algorithms for problems where a decision maker has to control a system composed of several components and has access to only partial information on the state of each component. Such problems are difficult because of…

Optimization and Control · Mathematics 2020-12-25 Victor Cohen , Axel Parmentier

This paper addresses the problem of optimal control of robotic sensing systems aimed at autonomous information gathering in scenarios such as environmental monitoring, search and rescue, and surveillance and reconnaissance. The information…

Systems and Control · Computer Science 2016-01-28 Mikko Lauri , Nikolay Atanasov , George J. Pappas , Risto Ritala

Partially observable Markov decision processes (POMDPs) provide an elegant mathematical framework for modeling complex decision and planning problems in stochastic domains in which states of the system are observable only indirectly, via a…

Artificial Intelligence · Computer Science 2011-06-02 M. Hauskrecht

We consider the problem of finding the best memoryless stochastic policy for an infinite-horizon partially observable Markov decision process (POMDP) with finite state and action spaces with respect to either the discounted or mean reward…

Optimization and Control · Mathematics 2022-05-02 Johannes Müller , Guido Montúfar

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their…

Artificial Intelligence · Computer Science 2014-01-16 Stéphane Ross , Joelle Pineau , Sébastien Paquet , Brahim Chaib-draa

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Decision-making problems in uncertain or stochastic domains are often formulated as Markov decision processes (MDPs). Policy iteration (PI) is a popular algorithm for searching over policy-space, the size of which is exponential in the…

Artificial Intelligence · Computer Science 2013-01-30 Yishay Mansour , Satinder Singh

We present a major improvement to the incremental pruning algorithm for solving partially observable Markov decision processes. Our technique targets the cross-sum step of the dynamic programming (DP) update, a key source of complexity in…

Artificial Intelligence · Computer Science 2012-07-19 Zhengzhu Feng , Shlomo Zilberstein

In this paper, we propose a new lower approximation scheme for POMDP with discounted and average cost criterion. The approximating functions are determined by their values at a finite number of belief points, and can be computed efficiently…

Artificial Intelligence · Computer Science 2012-07-19 Huizhen Yu , Dimitri Bertsekas

We consider a partially observable Markov decision problem (POMDP) that models a class of sequencing problems. Although POMDPs are typically intractable, our formulation admits tractable solution. Instead of maintaining a value function…

Artificial Intelligence · Computer Science 2013-01-14 Paat Rusmevichientong , Benjamin van Roy

Although risk awareness is fundamental to an online operating agent, it has received less attention in the challenging continuous domain and under partial observability. This paper presents a novel formulation and solution for risk-averse…

Artificial Intelligence · Computer Science 2023-02-22 Andrey Zhitnikov , Vadim Indelman

Integration is affected by the curse of dimensionality and quickly becomes intractable as the dimensionality of the problem grows. We propose a randomized algorithm that, with high probability, gives a constant-factor approximation of a…

Machine Learning · Computer Science 2013-02-28 Stefano Ermon , Carla P. Gomes , Ashish Sabharwal , Bart Selman
‹ Prev 1 2 3 10 Next ›