Related papers: Comparing Labelled Markov Decision Processes

Minimising the Probabilistic Bisimilarity Distance

A labelled Markov decision process (MDP) is a labelled Markov chain with nondeterminism; i.e., together with a strategy a labelled MDP induces a labelled Markov chain. The model is related to interval Markov chains. Motivated by…

Formal Languages and Automata Theory · Computer Science 2024-07-01 Stefan Kiefer , Qiyi Tang

Trace Refinement in Labelled Markov Decision Processes

Given two labelled Markov decision processes (MDPs), the trace-refinement problem asks whether for all strategies of the first MDP there exists a strategy of the second MDP such that the induced labelled Markov chains are trace-equivalent.…

Logic in Computer Science · Computer Science 2023-06-22 Nathanaël Fijalkow , Stefan Kiefer , Mahsa Shirmohammadi

CTMCs with Imprecisely Timed Observations

Labeled continuous-time Markov chains (CTMCs) describe processes subject to random timing and partial observability. In applications such as runtime monitoring, we must incorporate past observations. The timing of these observations matters…

Logic in Computer Science · Computer Science 2024-01-30 Thom Badings , Matthias Volk , Sebastian Junges , Marielle Stoelinga , Nils Jansen

Learning Mixtures of Markov Chains and MDPs

We present an algorithm for learning mixtures of Markov chains and Markov decision processes (MDPs) from short unlabeled trajectories. Specifically, our method handles mixtures of Markov chains with optional control input by going through a…

Machine Learning · Statistics 2023-02-07 Chinmaya Kausik , Kevin Tan , Ambuj Tewari

On the Total Variation Distance of Labelled Markov Chains

Labelled Markov chains (LMCs) are widely used in probabilistic verification, speech recognition, computational biology, and many other fields. Checking two LMCs for equivalence is a classical problem subject to extensive studies, while the…

Logic in Computer Science · Computer Science 2014-05-16 Taolue Chen , Stefan Kiefer

Robust Probabilistic Bisimilarity for Labelled Markov Chains

Despite its prevalence, probabilistic bisimilarity suffers from a lack of robustness under minuscule perturbations of the transition probabilities. This can lead to discontinuities in the probabilistic bisimilarity distance function,…

Logic in Computer Science · Computer Science 2025-05-22 Syyeda Zainab Fatmi , Stefan Kiefer , David Parker , Franck van Breugel

Bisimulations for Nondeterministic Labeled Markov Processes

We extend the theory of labeled Markov processes with internal nondeterminism, a fundamental concept for the further development of a process theory with abstraction on nondeterministic continuous probabilistic systems. We define…

Logic in Computer Science · Computer Science 2015-03-17 Pedro D'Argenio , Pedro Sánchez Terraf , Nicolás Wolovick

Scalable Verification of Markov Decision Processes

Markov decision processes (MDP) are useful to model concurrent process optimisation problems, but verifying them with numerical methods is often intractable. Existing approximative approaches do not scale well and are limited to memoryless…

Data Structures and Algorithms · Computer Science 2014-09-18 Axel Legay , Sean Sedwards , Louis-Marie Traonouez

Decisiveness for countable MDPs and insights for NPLCSs and POMDPs

Markov chains and Markov decision processes (MDPs) are well-established probabilistic models. While finite Markov models are well-understood, analysing their infinite counterparts remains a significant challenge. Decisiveness has proven to…

Logic in Computer Science · Computer Science 2025-04-23 Nathalie Bertrand , Patricia Bouyer , Thomas Brihaye , Paulin Fournier , Pierre Vandenhove

Distributionally robust chance-constrained Markov decision processes

Markov decision process (MDP) is a decision making framework where a decision maker is interested in maximizing the expected discounted value of a stream of rewards received at future stages at various states which are visited according to…

Optimization and Control · Mathematics 2022-12-19 Hoang Nam Nguyen , Abdel Lisser , Vikas Vikram Singh

Piecewise Deterministic Markov Processes and their invariant measures

Piecewise Deterministic Markov Processes (PDMPs) are studied in a general framework. First, different constructions are proven to be equivalent. Second, we introduce a coupling between two PDMPs following the same differential flow which…

Probability · Mathematics 2021-08-03 Alain Durmus , Arnaud Guillin , Pierre Monmarché

Near-Optimal Learning and Planning in Separated Latent MDPs

We study computational and statistical aspects of learning Latent Markov Decision Processes (LMDPs). In this model, the learner interacts with an MDP drawn at the beginning of each epoch from an unknown mixture of MDPs. To sidestep known…

Machine Learning · Computer Science 2024-06-13 Fan Chen , Constantinos Daskalakis , Noah Golowich , Alexander Rakhlin

MDPs with Unawareness

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision maker (DM) knows all states and actions. However, this may not…

Artificial Intelligence · Computer Science 2014-07-29 Joseph Y. Halpern , Nan Rong , Ashutosh Saxena

MDPs with Unawareness

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision maker (DM) knows all states and actions. However, this may not…

Artificial Intelligence · Computer Science 2010-06-14 Joseph Y. Halpern , Nan Rong , Ashutosh Saxena

Timed Comparisons of Semi-Markov Processes

Semi-Markov processes are Markovian processes in which the firing time of the transitions is modelled by probabilistic distributions over positive reals interpreted as the probability of firing a transition at a certain moment in time. In…

Formal Languages and Automata Theory · Computer Science 2017-12-04 Mathias Ruggaard Pedersen , Nathanaël Fijalkow , Giorgio Bacci , Kim Guldstrand Larsen , Radu Mardare

Distributed Markov Chains

The formal verification of large probabilistic models is important and challenging. Exploiting the concurrency that is often present is one way to address this problem. Here we study a restricted class of asynchronous distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-08-06 Sumit Kumar Jha , Madhavan Mukund , Ratul Saha , P S Thiagarajan

Synchronizing Objectives for Markov Decision Processes

We introduce synchronizing objectives for Markov decision processes (MDP). Intuitively, a synchronizing objective requires that eventually, at every step there is a state which concentrates almost all the probability mass. In particular, it…

Logic in Computer Science · Computer Science 2011-02-22 Laurent Doyen , Thierry Massart , Mahsa Shirmohammadi

On the Complexity of Reachability in Parametric Markov Decision Processes

This paper studies parametric Markov decision processes (pMDPs), an extension to Markov decision processes (MDPs) where transitions probabilities are described by polynomials over a finite set of parameters. Fixing values for all parameters…

Logic in Computer Science · Computer Science 2019-04-03 Tobias Winkler , Sebastian Junges , Guillermo A. Pérez , Joost-Pieter Katoen

Multi-Objective Approaches to Markov Decision Processes with Uncertain Transition Parameters

Markov decision processes (MDPs) are a popular model for performance analysis and optimization of stochastic systems. The parameters of stochastic behavior of MDPs are estimates from empirical observations of a system; their values are not…

Artificial Intelligence · Computer Science 2017-10-26 Dimitri Scheftelowitsch , Peter Buchholz , Vahid Hashemi , Holger Hermanns

Multiple-Environment Markov Decision Processes

We introduce Multi-Environment Markov Decision Processes (MEMDPs) which are MDPs with a set of probabilistic transition functions. The goal in a MEMDP is to synthesize a single controller with guaranteed performances against all…

Logic in Computer Science · Computer Science 2014-12-04 Jean-François Raskin , Ocan Sankur