Related papers: Sample Complexity Characterization for Linear Cont…

Learning to Decide with Just Enough: Information-Theoretic Context Summarization for CMDPs

Contextual Markov Decision Processes (CMDPs) offer a framework for sequential decision-making under external signals, but existing methods often fail to generalize in high-dimensional or unstructured contexts, resulting in excessive…

Artificial Intelligence · Computer Science 2025-10-06 Peidong Liu , Junjiang Lin , Shaowen Wang , Yao Xu , Haiqing Li , Xuhao Xie , Siyi Wu , Hao Li

Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning

Markov decision processes (MDPs) are used to model a wide variety of applications ranging from game playing over robotics to finance. Their optimal policy typically maximizes the expected sum of rewards given at each step of the decision…

Machine Learning · Computer Science 2025-05-26 Maximilian Nägele , Jan Olle , Thomas Fösel , Remmy Zen , Florian Marquardt

Reinforcement Learning with History-Dependent Dynamic Contexts

We introduce Dynamic Contextual Markov Decision Processes (DCMDPs), a novel reinforcement learning framework for history-dependent environments that generalizes the contextual MDP framework to handle non-Markov environments, where contexts…

Machine Learning · Computer Science 2023-05-19 Guy Tennenholtz , Nadav Merlis , Lior Shani , Martin Mladenov , Craig Boutilier

Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs

Many physical systems have underlying safety considerations that require that the policy employed ensures the satisfaction of a set of constraints. The analytical formulation usually takes the form of a Constrained Markov Decision Process…

Machine Learning · Computer Science 2021-03-03 Aria HasanzadeZonuzy , Archana Bura , Dileep Kalathil , Srinivas Shakkottai

Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles

The key assumption underlying linear Markov Decision Processes (MDPs) is that the learner has access to a known feature map $\phi(x, a)$ that maps state-action pairs to $d$-dimensional vectors, and that the rewards and transitions are…

Machine Learning · Computer Science 2023-09-20 Noah Golowich , Ankur Moitra , Dhruv Rohatgi

Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints

This paper addresses the challenge of solving Constrained Markov Decision Processes (CMDPs) with $d > 1$ constraints when the transition dynamics are unknown, but samples can be drawn from a generative model. We propose a model-based…

Machine Learning · Computer Science 2025-03-11 Max Buckley , Konstantinos Papathanasiou , Andreas Spanopoulos

Making Linear MDPs Practical via Contrastive Representation Learning

It is common to address the curse of dimensionality in Markov decision processes (MDPs) by exploiting low-rank representations. This motivates much of the recent theoretical study on linear MDPs. However, most approaches require a given…

Machine Learning · Computer Science 2022-12-09 Tianjun Zhang , Tongzheng Ren , Mengjiao Yang , Joseph E. Gonzalez , Dale Schuurmans , Bo Dai

On the Complexity of Robust Markov Decision Processes and Bisimulation Metrics

Robust Markov decision processes (RMDPs) extend standard Markov decision processes (MDPs) to account for uncertainty in the transition probabilities. RMDPs have an uncertainty set that defines a set of possible transition functions, each of…

Logic in Computer Science · Computer Science 2026-04-30 Marnix Suilen , Guillermo A. Pérez

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the…

Artificial Intelligence · Computer Science 2024-12-11 Marnix Suilen , Thom Badings , Eline M. Bovy , David Parker , Nils Jansen

On the Complexity of Reachability in Parametric Markov Decision Processes

This paper studies parametric Markov decision processes (pMDPs), an extension to Markov decision processes (MDPs) where transitions probabilities are described by polynomials over a finite set of parameters. Fixing values for all parameters…

Logic in Computer Science · Computer Science 2019-04-03 Tobias Winkler , Sebastian Junges , Guillermo A. Pérez , Joost-Pieter Katoen

Achieving Instance-dependent Sample Complexity for Constrained Markov Decision Process

We consider the reinforcement learning problem for the constrained Markov decision process (CMDP), which plays a central role in satisfying safety or resource constraints in sequential learning and decision-making. In this problem, we are…

Machine Learning · Computer Science 2025-11-19 Jiashuo Jiang , Yinyu Ye

Contextual Markov Decision Processes

We consider a planning problem where the dynamics and rewards of the environment depend on a hidden static parameter referred to as the context. The objective is to learn a strategy that maximizes the accumulated reward across all contexts.…

Machine Learning · Statistics 2015-02-10 Assaf Hallak , Dotan Di Castro , Shie Mannor

Chance and Mass Interpretations of Probabilities in Markov Decision Processes (Extended Version)

Markov decision processes (MDPs) are a popular model for decision-making in the presence of uncertainty. The conventional view of MDPs in verification treats them as state transformers with probabilities defined over sequences of states and…

Formal Languages and Automata Theory · Computer Science 2025-07-25 Yun Chen Tsai , Kittiphon Phalakarn , S. Akshay , Ichiro Hasuo

Multiple-Environment Markov Decision Processes

We introduce Multi-Environment Markov Decision Processes (MEMDPs) which are MDPs with a set of probabilistic transition functions. The goal in a MEMDP is to synthesize a single controller with guaranteed performances against all…

Logic in Computer Science · Computer Science 2014-12-04 Jean-François Raskin , Ocan Sankur

Generalized Linear Markov Decision Process

The linear Markov Decision Process (MDP) framework offers a principled foundation for reinforcement learning (RL) with strong theoretical guarantees and sample efficiency. However, its restrictive assumption-that both transition dynamics…

Machine Learning · Statistics 2025-06-03 Sinian Zhang , Kaicheng Zhang , Ziping Xu , Tianxi Cai , Doudou Zhou

Confident Natural Policy Gradient for Local Planning in $q_\pi$-realizable Constrained MDPs

The constrained Markov decision process (CMDP) framework emerges as an important reinforcement learning approach for imposing safety or other critical objectives while maximizing cumulative reward. However, the current understanding of how…

Machine Learning · Computer Science 2024-12-11 Tian Tian , Lin F. Yang , Csaba Szepesvári

Learning Efficiently Function Approximation for Contextual MDP

We study learning contextual MDPs using a function approximation for both the rewards and the dynamics. We consider both the case that the dynamics dependent or independent of the context. For both models we derive polynomial sample and…

Machine Learning · Computer Science 2022-12-01 Orin Levy , Yishay Mansour

Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model

The curse of dimensionality is a widely known issue in reinforcement learning (RL). In the tabular setting where the state space $\mathcal{S}$ and the action space $\mathcal{A}$ are both finite, to obtain a nearly optimal policy with…

Machine Learning · Computer Science 2022-10-28 Bingyan Wang , Yuling Yan , Jianqing Fan

On the Complexity of Solving Markov Decision Problems

Markov decision problems (MDPs) provide the foundations for a number of problems of interest to AI researchers studying automated planning and reinforcement learning. In this paper, we summarize results regarding the complexity of solving…

Artificial Intelligence · Computer Science 2013-02-21 Michael L. Littman , Thomas L. Dean , Leslie Pack Kaelbling

Verification of Markov Decision Processes using Learning Algorithms

We present a general framework for applying machine-learning algorithms to the verification of Markov decision processes (MDPs). The primary goal of these techniques is to improve performance by avoiding an exhaustive exploration of the…

Logic in Computer Science · Computer Science 2015-03-31 Tomáš Brázdil , Krishnendu Chatterjee , Martin Chmelík , Vojtěch Forejt , Jan Křetínský , Marta Kwiatkowska , David Parker , Mateusz Ujma