Related papers: Robust Parameter Learning for Uncertain MDPs

Robust Anytime Learning of Markov Decision Processes

Markov decision processes (MDPs) are formal models commonly used in sequential decision-making. MDPs capture the stochasticity that may arise, for instance, from imprecise actuators via probabilities in the transition function. However, in…

Artificial Intelligence · Computer Science 2023-06-21 Marnix Suilen , Thiago D. Simão , David Parker , Nils Jansen

Scenario-Based Verification of Uncertain MDPs

We consider Markov decision processes (MDPs) in which the transition probabilities and rewards belong to an uncertainty set parametrized by a collection of random variables. The probability distributions for these random parameters are…

Logic in Computer Science · Computer Science 2020-02-26 Murat Cubuktepe , Nils Jansen , Sebastian Junges , Joost-Pieter Katoen , Ufuk Topcu

Certifiably Robust Policies for Uncertain Parametric Environments

We present a data-driven approach for producing policies that are provably robust across unknown stochastic environments. Existing approaches can learn models of a single environment as an interval Markov decision processes (IMDP) and…

Machine Learning · Computer Science 2025-03-25 Yannik Schnitzer , Alessandro Abate , David Parker

Learning Robust Policies for Uncertain Parametric Markov Decision Processes

Synthesising verifiably correct controllers for dynamical systems is crucial for safety-critical problems. To achieve this, it is important to account for uncertainty in a robust manner, while at the same time it is often of interest to…

Systems and Control · Electrical Eng. & Systems 2024-05-16 Luke Rickard , Alessandro Abate , Kostas Margellos

Parameter-Independent Strategies for pMDPs via POMDPs

Markov Decision Processes (MDPs) are a popular class of models suitable for solving control decision problems in probabilistic reactive systems. We consider parametric MDPs (pMDPs) that include parameters in some of the transition…

Logic in Computer Science · Computer Science 2018-06-14 Sebastian Arming , Ezio Bartocci , Krishnendu Chatterjee , Joost-Pieter Katoen , Ana Sokolova

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the…

Artificial Intelligence · Computer Science 2024-12-11 Marnix Suilen , Thom Badings , Eline M. Bovy , David Parker , Nils Jansen

Scenario-Based Verification of Uncertain Parametric MDPs

We consider parametric Markov decision processes (pMDPs) that are augmented with unknown probability distributions over parameter values. The problem is to compute the probability to satisfy a temporal logic specification with any concrete…

Logic in Computer Science · Computer Science 2022-12-08 Thom Badings , Murat Cubuktepe , Nils Jansen , Sebastian Junges , Joost-Pieter Katoen , Ufuk Topcu

Multi-Objective Approaches to Markov Decision Processes with Uncertain Transition Parameters

Markov decision processes (MDPs) are a popular model for performance analysis and optimization of stochastic systems. The parameters of stochastic behavior of MDPs are estimates from empirical observations of a system; their values are not…

Artificial Intelligence · Computer Science 2017-10-26 Dimitri Scheftelowitsch , Peter Buchholz , Vahid Hashemi , Holger Hermanns

Efficient Solution and Learning of Robust Factored MDPs

Robust Markov decision processes (r-MDPs) extend MDPs by explicitly modelling epistemic uncertainty about transition dynamics. Learning r-MDPs from interactions with an unknown environment enables the synthesis of robust policies with…

Machine Learning · Computer Science 2025-11-21 Yannik Schnitzer , Alessandro Abate , David Parker

Multi-objective Robust Strategy Synthesis for Interval Markov Decision Processes

Interval Markov decision processes (IMDPs) generalise classical MDPs by having interval-valued transition probabilities. They provide a powerful modelling tool for probabilistic systems with an additional variation or uncertainty that…

Systems and Control · Computer Science 2017-07-07 Ernst Moritz Hahn , Vahid Hashemi , Holger Hermanns , Morteza Lahijanian , Andrea Turrini

Scaling Up Robust MDPs by Reinforcement Learning

We consider large-scale Markov decision processes (MDPs) with parameter uncertainty, under the robust MDP paradigm. Previous studies showed that robust MDPs, based on a minimax approach to handle uncertainty, can be solved using dynamic…

Machine Learning · Computer Science 2013-06-27 Aviv Tamar , Huan Xu , Shie Mannor

Robust Markov Decision Process: Beyond Rectangularity

We consider a robust approach to address uncertainty in model parameters in Markov Decision Processes (MDPs), which are widely used to model dynamic optimization in many applications. Most prior works consider the case where the uncertainty…

Optimization and Control · Mathematics 2021-09-02 Vineet Goyal , Julien Grand-Clément

Linear Mixture Distributionally Robust Markov Decision Processes

Many real-world decision-making problems face the off-dynamics challenge: the agent learns a policy in a source domain and deploys it in a target domain with different state transitions. The distributionally robust Markov decision process…

Machine Learning · Computer Science 2025-05-26 Zhishuai Liu , Pan Xu

Risk Aversion to Parameter Uncertainty in Markov Decision Processes with an Application to Slow-Onset Disaster Relief

In classical Markov Decision Processes (MDPs), action costs and transition probabilities are assumed to be known, although an accurate estimation of these parameters is often not possible in practice. This study addresses MDPs under cost…

Optimization and Control · Mathematics 2019-06-24 Merve Merakli , Simge Kucukyavuz

Solving Robust Markov Decision Processes: Generic, Reliable, Efficient

Markov decision processes (MDP) are a well-established model for sequential decision-making in the presence of probabilities. In robust MDP (RMDP), every action is associated with an uncertainty set of probability distributions, modelling…

Artificial Intelligence · Computer Science 2024-12-16 Tobias Meggendorfer , Maximilian Weininger , Patrick Wienhöft

Learning Algorithms for Verification of Markov Decision Processes

We present a general framework for applying learning algorithms and heuristical guidance to the verification of Markov decision processes (MDPs). The primary goal of our techniques is to improve performance by avoiding an exhaustive…

Systems and Control · Electrical Eng. & Systems 2025-04-02 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelik , Vojtěch Forejt , Jan Křetínský , Marta Kwiatkowska , Tobias Meggendorfer , David Parker , Mateusz Ujma

Temporal Logic Control for Nonlinear Stochastic Systems Under Unknown Disturbances

In this paper, we present a novel framework to synthesize robust strategies for discrete-time nonlinear systems with random disturbances that are unknown, against temporal logic specifications. The proposed framework is data-driven and…

Systems and Control · Electrical Eng. & Systems 2025-04-29 Ibon Gracia , Luca Laurenti , Manuel Mazo , Alessandro Abate , Morteza Lahijanian

On the Complexity of Discounted Robust MDPs with $L_p$ Uncertainty Sets

A basic model in sequential decision making is the Markov decision process (MDP), which is extended to Robust MDPs (RMDPs) by allowing uncertainty in transition probabilities and optimizing against the worst-case transition probabilities…

Computational Complexity · Computer Science 2026-05-11 Ali Asadi , Krishnendu Chatterjee , Alipasha Montaseri , Ali Shafiee

On the Complexity of Robust Markov Decision Processes and Bisimulation Metrics

Robust Markov decision processes (RMDPs) extend standard Markov decision processes (MDPs) to account for uncertainty in the transition probabilities. RMDPs have an uncertainty set that defines a set of possible transition functions, each of…

Logic in Computer Science · Computer Science 2026-04-30 Marnix Suilen , Guillermo A. Pérez

Sample Complexity of Robust Reinforcement Learning with a Generative Model

The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are robust against the parameter uncertainties due to the mismatches between the simulator model and real-world settings. An RMDP problem is…

Machine Learning · Computer Science 2022-05-17 Kishan Panaganti , Dileep Kalathil