English
Related papers

Related papers: Efficient Solution Algorithms for Factored MDPs

200 papers

Although many real-world stochastic planning problems are more naturally formulated by hybrid models with both discrete and continuous variables, current state-of-the-art methods cannot adequately address these problems. We present the…

Artificial Intelligence · Computer Science 2012-07-19 Carlos E. Guestrin , Milos Hauskrecht , Branislav Kveton

Factored Markov decision processes (MDPs) are a prominent paradigm within the artificial intelligence community for modeling and solving large-scale MDPs whose rewards and dynamics decompose into smaller, loosely interacting components.…

Optimization and Control · Mathematics 2024-04-03 Huikang Liu , Wolfram Wiesemann , Man-Chung Yue

Large-scale Markov decision processes (MDPs) require planning algorithms with runtime independent of the number of states of the MDP. We consider the planning problem in MDPs using linear value function approximation with only weak…

Machine Learning · Computer Science 2020-07-14 Roshan Shariff , Csaba Szepesvári

We present a method for solving implicit (factored) Markov decision processes (MDPs) with very large state spaces. We introduce a property of state space partitions which we call epsilon-homogeneity. Intuitively, an epsilon-homogeneous…

Artificial Intelligence · Computer Science 2013-02-08 Thomas L. Dean , Robert Givan , Sonia Leach

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal value function by a set of basis functions and optimize…

Artificial Intelligence · Computer Science 2012-06-18 Branislav Kveton , Milos Hauskrecht

Markov decisions processes (MDPs) are becoming increasing popular as models of decision theoretic planning. While traditional dynamic programming methods perform well for problems with small state spaces, structured methods are needed for…

Artificial Intelligence · Computer Science 2013-01-30 Jesse Hoey , Robert St-Aubin , Alan Hu , Craig Boutilier

We consider deterministic Markov decision processes (MDPs) and apply max-plus algebra tools to approximate the value iteration algorithm by a smaller-dimensional iteration based on a representation on dictionaries of value functions. The…

Machine Learning · Computer Science 2019-06-21 Francis Bach

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automated decision support systems. In this paper, we describe a…

Artificial Intelligence · Computer Science 2011-10-04 C. Guestrin , M. Hauskrecht , B. Kveton

Recent research in decision theoretic planning has focussed on making the solution of Markov decision processes (MDPs) more feasible. We develop a family of algorithms for structured reachability analysis of MDPs that are suitable when an…

Artificial Intelligence · Computer Science 2013-04-24 Craig Boutilier , Ronen I. Brafman , Christopher W. Geib

Many large MDPs can be represented compactly using a dynamic Bayesian network. Although the structure of the value function does not retain the structure of the process, recent work has shown that value functions in factored MDPs can often…

Artificial Intelligence · Computer Science 2013-01-18 Daphne Koller , Ron Parr

We consider the problem of controlling a fully specified Markov decision process (MDP), also known as the planning problem, when the state space is very large and calculating the optimal policy is intractable. Instead, we pursue the more…

Optimization and Control · Mathematics 2019-01-09 Yasin Abbasi-Yadkori , Peter L. Bartlett , Xi Chen , Alan Malek

We consider large-scale Markov decision processes (MDPs) with a risk measure of variability in cost, under the risk-aware MDPs paradigm. Previous studies showed that risk-aware MDPs, based on a minimax approach to handling risk, can be…

Systems and Control · Computer Science 2017-05-17 Pengqian Yu , William B. Haskell , Huan Xu

Many exact and approximate solution methods for Markov Decision Processes (MDPs) attempt to exploit structure in the problem and are based on factorization of the value function. Especially multiagent settings, however, are known to suffer…

Artificial Intelligence · Computer Science 2016-02-23 Philipp Robbel , Frans A. Oliehoek , Mykel J. Kochenderfer

We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic…

Machine Learning · Computer Science 2013-06-27 Aviv Tamar , Huan Xu , Shie Mannor

Robust Markov decision processes (r-MDPs) extend MDPs by explicitly modelling epistemic uncertainty about transition dynamics. Learning r-MDPs from interactions with an unknown environment enables the synthesis of robust policies with…

Machine Learning · Computer Science 2025-11-21 Yannik Schnitzer , Alessandro Abate , David Parker

Software-intensive systems, such as software product lines and robotics, utilise Markov decision processes (MDPs) to capture uncertainty and analyse sequential decision-making problems. Despite the usefulness of conventional policy…

Artificial Intelligence · Computer Science 2026-05-01 Alexandros Evangelidis , Gricel Vázquez , Simos Gerasimou

The Bayes-Adaptive Markov Decision Process (BAMDP) formalism pursues the Bayes-optimal solution to the exploration-exploitation trade-off in reinforcement learning. As the computation of exact solutions to Bayesian reinforcement-learning…

Machine Learning · Computer Science 2022-11-01 Dilip Arumugam , Satinder Singh

This paper presents two new approaches to decomposing and solving large Markov decision problems (MDPs), a partial decoupling method and a complete decoupling method. In these approaches, a large, stochastic decision problem is divided into…

Artificial Intelligence · Computer Science 2013-02-01 Ron Parr

Markov Decision Processes (MDP) is an useful framework to cast optimal sequential decision making problems. Given any MDP the aim is to find the optimal action selection mechanism i.e., the optimal policy. Typically, the optimal policy…

Systems and Control · Computer Science 2014-03-18 Chandrashekar Lakshminarayanan , Shalabh Bhatnagar

The key assumption underlying linear Markov Decision Processes (MDPs) is that the learner has access to a known feature map $\phi(x, a)$ that maps state-action pairs to $d$-dimensional vectors, and that the rewards and transitions are…

Machine Learning · Computer Science 2023-09-20 Noah Golowich , Ankur Moitra , Dhruv Rohatgi
‹ Prev 1 2 3 10 Next ›