Related papers: Robust Markov Decision Process: Beyond Rectangular…

Robust Anytime Learning of Markov Decision Processes

Markov decision processes (MDPs) are formal models commonly used in sequential decision-making. MDPs capture the stochasticity that may arise, for instance, from imprecise actuators via probabilities in the transition function. However, in…

Artificial Intelligence · Computer Science 2023-06-21 Marnix Suilen , Thiago D. Simão , David Parker , Nils Jansen

Solving Robust Markov Decision Processes: Generic, Reliable, Efficient

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Distributionally Robust Optimization for Sequential Decision Making

The distributionally robust Markov Decision Process (MDP) approach asks for a distributionally robust policy that achieves the maximal expected total reward under the most adversarial distribution of uncertain parameters. In this paper, we…

Systems and Control · Computer Science 2018-10-10 Zhi Chen , Pengqian Yu , William B. Haskell

Robust Markov Decision Processes on Continuous State Spaces

We study infinite-horizon robust Markov decision processes (MDPs) on continuous state spaces with structured rectangular ambiguity set. The proposed ambiguity set falls within the convex hull of unknown generating kernels. We utilize the…

Optimization and Control · Mathematics 2026-05-28 Mengmeng Li , Yifan Hu , Daniel Kuhn , Yan Li

Distributionally Robust Counterpart in Markov Decision Processes

This paper studies Markov Decision Processes under parameter uncertainty. We adapt the distributionally robust optimization framework, and assume that the uncertain parameters are random variables following an unknown distribution, and…

Systems and Control · Computer Science 2015-05-14 Pengqian Yu , Huan Xu

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the…

Artificial Intelligence · Computer Science 2024-12-11 Marnix Suilen , Thom Badings , Eline M. Bovy , David Parker , Nils Jansen

Robust Markov Decision Processes without Model Estimation

Robust Markov Decision Processes (MDPs) are receiving much attention in learning a robust policy which is less sensitive to environment changes. There are an increasing number of works analyzing sample-efficiency of robust MDPs. However,…

Machine Learning · Statistics 2023-09-13 Wenhao Yang , Han Wang , Tadashi Kozuno , Scott M. Jordan , Zhihua Zhang

Sequential Decision-Making under Uncertainty: A Robust MDPs review

Fueled by advances in both robust optimization theory and reinforcement learning (RL), robust Markov Decision Processes (RMDPs) have garnered increasing attention due to their powerful capability for sequential decision-making under…

Optimization and Control · Mathematics 2025-07-08 Wenfan Ou , Sheng Bi

Learning Robust Policies for Uncertain Parametric Markov Decision Processes

Synthesising verifiably correct controllers for dynamical systems is crucial for safety-critical problems. To achieve this, it is important to account for uncertainty in a robust manner, while at the same time it is often of interest to…

Systems and Control · Electrical Eng. & Systems 2024-05-16 Luke Rickard , Alessandro Abate , Kostas Margellos

Robust Deterministic Policies for Markov Decision Processes under Budgeted Uncertainty

This paper studies the computation of robust deterministic policies for Markov Decision Processes (MDPs) in the Lightning Does Not Strike Twice (LDST) model of Mannor, Mebel and Xu (ICML '12). In this model, designed to provide robustness…

Optimization and Control · Mathematics 2024-12-18 Fei Wu , Erik Demeulemeester , Jannik Matuschke

Robust Entropy-regularized Markov Decision Processes

Stochastic and soft optimal policies resulting from entropy-regularized Markov decision processes (ER-MDP) are desirable for exploration and imitation learning applications. Motivated by the fact that such policies are sensitive with…

Machine Learning · Computer Science 2022-01-03 Tien Mai , Patrick Jaillet

Best-Effort Policies for Robust Markov Decision Processes

We study the common generalization of Markov decision processes (MDPs) with sets of transition probabilities, known as robust MDPs (RMDPs). A standard goal in RMDPs is to compute a policy that maximizes the expected return under an…

Artificial Intelligence · Computer Science 2025-11-20 Alessandro Abate , Thom Badings , Giuseppe De Giacomo , Francesco Fabiano

Efficient Solution and Learning of Robust Factored MDPs

Robust Markov decision processes (r-MDPs) extend MDPs by explicitly modelling epistemic uncertainty about transition dynamics. Learning r-MDPs from interactions with an unknown environment enables the synthesis of robust policies with…

Machine Learning · Computer Science 2025-11-21 Yannik Schnitzer , Alessandro Abate , David Parker

Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets

We propose policy gradient algorithms for robust infinite-horizon Markov decision processes (MDPs) with non-rectangular uncertainty sets, thereby addressing an open challenge in the robust MDP literature. Indeed, uncertainty sets that…

Optimization and Control · Mathematics 2025-09-30 Mengmeng Li , Daniel Kuhn , Tobias Sutter

Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems with Markov Risk Measures

We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model…

Optimization and Control · Mathematics 2022-09-28 Shiping Shao , Abhishek Gupta , William B. Haskell

Robust Counterfactual Inference in Markov Decision Processes

This paper addresses a key limitation in existing counterfactual inference methods for Markov Decision Processes (MDPs). Current approaches assume a specific causal model to make counterfactuals identifiable. However, there are usually many…

Artificial Intelligence · Computer Science 2026-05-25 Jessica Lally , Milad Kazemi , Nicola Paoletti

Online MDP with Transition Prototypes: A Robust Adaptive Approach

In this work, we consider an online robust Markov Decision Process (MDP) where we have the information of finitely many prototypes of the underlying transition kernel. We consider an adaptively updated ambiguity set of the prototypes and…

Machine Learning · Computer Science 2024-12-20 Shuo Sun , Meng Qi , Zuo-Jun Max Shen

Efficient Algorithms for Robust Markov Decision Processes with $s$-Rectangular Ambiguity Sets

Robust Markov decision processes (MDPs) have attracted significant interest due to their ability to protect MDPs from poor out-of-sample performance in the presence of ambiguity. In contrast to classical MDPs, which account for…

Optimization and Control · Mathematics 2026-02-06 Chin Pang Ho , Marek Petrik , Wolfram Wiesemann

Robust Average-Reward Markov Decision Processes

In robust Markov decision processes (MDPs), the uncertainty in the transition kernel is addressed by finding a policy that optimizes the worst-case performance over an uncertainty set of MDPs. While much of the literature has focused on…

Machine Learning · Computer Science 2023-03-02 Yue Wang , Alvaro Velasquez , George Atia , Ashley Prater-Bennette , Shaofeng Zou

Multi-Objective Approaches to Markov Decision Processes with Uncertain Transition Parameters

Markov decision processes (MDPs) are a popular model for performance analysis and optimization of stochastic systems. The parameters of stochastic behavior of MDPs are estimates from empirical observations of a system; their values are not…

Artificial Intelligence · Computer Science 2017-10-26 Dimitri Scheftelowitsch , Peter Buchholz , Vahid Hashemi , Holger Hermanns