English
Related papers

Related papers: Robust Phi-Divergence MDPs

200 papers

Robust Markov decision processes (MDPs) have attracted significant interest due to their ability to protect MDPs from poor out-of-sample performance in the presence of ambiguity. In contrast to classical MDPs, which account for…

Optimization and Control · Mathematics 2026-02-06 Chin Pang Ho , Marek Petrik , Wolfram Wiesemann

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the…

Artificial Intelligence · Computer Science 2024-12-11 Marnix Suilen , Thom Badings , Eline M. Bovy , David Parker , Nils Jansen

Fueled by advances in both robust optimization theory and reinforcement learning (RL), robust Markov Decision Processes (RMDPs) have garnered increasing attention due to their powerful capability for sequential decision-making under…

Optimization and Control · Mathematics 2025-07-08 Wenfan Ou , Sheng Bi

Robust Markov Decision Processes (MDPs) are receiving much attention in learning a robust policy which is less sensitive to environment changes. There are an increasing number of works analyzing sample-efficiency of robust MDPs. However,…

Machine Learning · Statistics 2023-09-13 Wenhao Yang , Han Wang , Tadashi Kozuno , Scott M. Jordan , Zhihua Zhang

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

The distributionally robust Markov Decision Process (MDP) approach asks for a distributionally robust policy that achieves the maximal expected total reward under the most adversarial distribution of uncertain parameters. In this paper, we…

Systems and Control · Computer Science 2018-10-10 Zhi Chen , Pengqian Yu , William B. Haskell

We present an efficient robust value iteration for \texttt{s}-rectangular robust Markov Decision Processes (MDPs) with a time complexity comparable to standard (non-robust) MDPs which is significantly faster than any existing method. We do…

Machine Learning · Computer Science 2023-02-01 Navdeep Kumar , Kfir Levy , Kaixin Wang , Shie Mannor

Robust Markov decision processes (MDPs) allow to compute reliable solutions for dynamic decision problems whose evolution is modeled by rewards and partially-known transition probabilities. Unfortunately, accounting for uncertainty in the…

Machine Learning · Computer Science 2020-06-18 Chin Pang Ho , Marek Petrik , Wolfram Wiesemann

The main goal of this paper is to discuss several approaches to formulation of distributionally robust counterparts of Markov Decision Processes, where the transition kernels are not specified exactly but rather are assumed to be elements…

Optimization and Control · Mathematics 2024-05-07 Yan Li , Alexander Shapiro

We study infinite-horizon robust Markov decision processes (MDPs) on continuous state spaces with structured rectangular ambiguity set. The proposed ambiguity set falls within the convex hull of unknown generating kernels. We utilize the…

Optimization and Control · Mathematics 2026-05-28 Mengmeng Li , Yifan Hu , Daniel Kuhn , Yan Li

Markov decision processes (MDPs) are formal models commonly used in sequential decision-making. MDPs capture the stochasticity that may arise, for instance, from imprecise actuators via probabilities in the transition function. However, in…

Artificial Intelligence · Computer Science 2023-06-21 Marnix Suilen , Thiago D. Simão , David Parker , Nils Jansen

Robust Markov decision processes (MDPs) are used for applications of dynamic optimization in uncertain environments and have been studied extensively. Many of the main properties and algorithms of MDPs, such as value iteration and policy…

Optimization and Control · Mathematics 2023-12-14 Julien Grand-Clément , Marek Petrik

Robust Markov decision processes (MDPs) aim to handle changing or partially known system dynamics. To solve them, one typically resorts to robust optimization methods. However, this significantly increases computational complexity and…

Machine Learning · Computer Science 2023-03-14 Esther Derman , Yevgeniy Men , Matthieu Geist , Shie Mannor

Robustness is important for sequential decision making in a stochastic dynamic environment with uncertain probabilistic parameters. We address the problem of using robust MDPs (RMDPs) to compute policies with provable worst-case guarantees…

Machine Learning · Computer Science 2018-11-16 Reazul Hasan Russel , Marek Petrik

We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic…

Machine Learning · Computer Science 2013-06-27 Aviv Tamar , Huan Xu , Shie Mannor

We study robust Markov decision processes (RMDPs) with non-rectangular uncertainty sets, which capture interdependencies across states unlike traditional rectangular models. While non-rectangular robust policy evaluation is generally…

Artificial Intelligence · Computer Science 2025-02-14 Navdeep Kumar , Adarsh Gupta , Maxence Mohamed Elfatihi , Giorgia Ramponi , Kfir Yehuda Levy , Shie Mannor

Robust Markov decision processes (MDPs) aim to handle changing or partially known system dynamics. To solve them, one typically resorts to robust optimization methods. However, this significantly increases computational complexity and…

Machine Learning · Computer Science 2021-10-14 Esther Derman , Matthieu Geist , Shie Mannor

Robust Markov Decision Processes (MDPs) are a powerful framework for modeling sequential decision-making problems with model uncertainty. This paper proposes the first first-order framework for solving robust MDPs. Our algorithm interleaves…

Optimization and Control · Mathematics 2021-01-18 Julien Grand-Clément , Christian Kroer

In this work, we consider an online robust Markov Decision Process (MDP) where we have the information of finitely many prototypes of the underlying transition kernel. We consider an adaptively updated ambiguity set of the prototypes and…

Machine Learning · Computer Science 2024-12-20 Shuo Sun , Meng Qi , Zuo-Jun Max Shen

In this paper we investigate the tractability of robust Markov Decision Processes (RMDPs) under various structural assumptions on the uncertainty set. Surprisingly, we show that in all generality (i.e. without any assumption on the…

Optimization and Control · Mathematics 2024-11-14 Julien Grand-Clément , Nian Si , Shengbo Wang
‹ Prev 1 2 3 10 Next ›