Related papers: Algorithms for Game Metrics

Scalable methods for computing state similarity in deterministic Markov Decision Processes

We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Processes (MDPs). Bisimulation metrics are an elegant formalism that capture behavioral equivalence between states and provide strong…

Machine Learning · Computer Science 2019-11-22 Pablo Samuel Castro

Game Refinement Relations and Metrics

We consider two-player games played over finite state spaces for an infinite number of rounds. At each state, the players simultaneously choose moves; the moves determine a successor state. It is often advantageous for players to choose…

Logic in Computer Science · Computer Science 2015-07-01 Luca de Alfaro , Rupak Majumdar , Vishwanath Raman , Mariëlle Stoelinga

Metrics for Finite Markov Decision Processes

We present metrics for measuring the similarity of states in a finite Markov decision process (MDP). The formulation of our metrics is based on the notion of bisimulation for MDPs, with an aim towards solving discounted infinite horizon…

Artificial Intelligence · Computer Science 2012-07-19 Norman Ferns , Prakash Panangaden , Doina Precup

A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications

The bisimulation metric (BSM) is a powerful tool for computing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

Machine Learning · Computer Science 2025-11-04 Zhenyu Tao , Wei Xu , Xiaohu You

Approximate Policy Iteration with Bisimulation Metrics

Bisimulation metrics define a distance measure between states of a Markov decision process (MDP) based on a comparison of reward sequences. Due to this property they provide theoretical guarantees in value function approximation (VFA). In…

Machine Learning · Computer Science 2022-11-15 Mete Kemertas , Allan Jepson

A Theoretical Analysis of State Similarity Between Markov Decision Processes

The bisimulation metric (BSM) is a powerful tool for analyzing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

Machine Learning · Computer Science 2025-12-22 Zhenyu Tao , Wei Xu , Xiaohu You

Distances for Markov chains from sample streams

Bisimulation metrics are powerful tools for measuring similarities between stochastic processes, and specifically Markov chains. Recent advances have uncovered that bisimulation metrics are, in fact, optimal-transport distances, which has…

Machine Learning · Computer Science 2025-05-26 Sergio Calo , Anders Jonsson , Gergely Neu , Ludovic Schwartz , Javier Segovia-Aguas

Computing Probabilistic Bisimilarity Distances for Probabilistic Automata

The probabilistic bisimilarity distance of Deng et al. has been proposed as a robust quantitative generalization of Segala and Lynch's probabilistic bisimilarity for probabilistic automata. In this paper, we present a characterization of…

Formal Languages and Automata Theory · Computer Science 2023-06-22 Giorgio Bacci , Giovanni Bacci , Kim G. Larsen , Radu Mardare , Qiyi Tang , Franck van Breugel

On the Complexity of Robust Markov Decision Processes and Bisimulation Metrics

Robust Markov decision processes (RMDPs) extend standard Markov decision processes (MDPs) to account for uncertainty in the transition probabilities. RMDPs have an uncertainty set that defines a set of possible transition functions, each of…

Logic in Computer Science · Computer Science 2026-04-30 Marnix Suilen , Guillermo A. Pérez

Concurrent Stochastic Games with Stateful-discounted and Parity Objectives: Complexity and Algorithms

We study two-player zero-sum concurrent stochastic games with finite state and action space played for an infinite number of steps. In every step, the two players simultaneously and independently choose an action. Given the current state…

Computer Science and Game Theory · Computer Science 2024-10-10 Ali Asadi , Krishnendu Chatterjee , Raimundo Saona , Jakub Svoboda

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

Learning generalizeable policies from visual input in the presence of visual distractions is a challenging problem in reinforcement learning. Recently, there has been renewed interest in bisimulation metrics as a tool to address this issue;…

Machine Learning · Computer Science 2022-01-31 Martin Bertran , Walter Talbott , Nitish Srivastava , Joshua Susskind

Stochastic Games with Synchronizing Objectives

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and two-player deterministic games by adding…

Computer Science and Game Theory · Computer Science 2022-02-28 Laurent Doyen

Parity Game Reductions

Parity games play a central role in model checking and satisfiability checking. Solving parity games is computationally expensive, among others due to the size of the games, which, for model checking problems, can easily contain $10^9$…

Logic in Computer Science · Computer Science 2016-03-22 S. Cranen , J. J. A. Keiren , T. A. C. Willemse

Metrics for Markov Decision Processes with Infinite State Spaces

We present metrics for measuring state similarity in Markov decision processes (MDPs) with infinitely many states, including MDPs with continuous state spaces. Such metrics provide a stable quantitative analogue of the notion of…

Artificial Intelligence · Computer Science 2012-07-09 Norman Ferns , Prakash Panangaden , Doina Precup

Approximating Values of Generalized-Reachability Stochastic Games

Simple stochastic games are turn-based 2.5-player games with a reachability objective. The basic question asks whether one player can ensure reaching a given target with at least a given probability. A natural extension is games with a…

Computer Science and Game Theory · Computer Science 2021-02-02 Pranav Ashok , Krishnendu Chatterjee , Jan Kretinsky , Maximilian Weininger , Tobias Winkler

Termination Criteria for Solving Concurrent Safety and Reachability Games

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2008-09-25 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

Strategy Improvement for Concurrent Safety Games

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2008-12-18 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

A Stochastic Game Approach to Masking Fault-Tolerance: Bisimulation and Quantification

We introduce a formal notion of masking fault-tolerance between probabilistic transition systems based on a variant of probabilistic bisimulation (named masking simulation). We also provide the corresponding probabilistic game…

Logic in Computer Science · Computer Science 2022-07-06 Pablo F. Castro , Pedro D'Argenio , Luciano Putruele , Ramiro Demasi

Strategy Improvement for Concurrent Reachability and Safety Games

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2012-07-03 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

Quantitative Timed Simulation Functions and Refinement Metrics for Timed Systems (Full Version)

We introduce quantatitive timed refinement and timed simulation (directed) metrics, incorporating zenoness check s, for timed systems. These metrics assign positive real numbers between zero and infinity which quantify the \emph{timing…

Systems and Control · Computer Science 2015-03-19 Krishnendu Chatterjee , Vinayak S. Prabhu