English
Related papers

Related papers: Algorithms for Game Metrics

200 papers

We present new algorithms for computing and approximating bisimulation metrics in Markov Decision Processes (MDPs). Bisimulation metrics are an elegant formalism that capture behavioral equivalence between states and provide strong…

Machine Learning · Computer Science 2019-11-22 Pablo Samuel Castro

We consider two-player games played over finite state spaces for an infinite number of rounds. At each state, the players simultaneously choose moves; the moves determine a successor state. It is often advantageous for players to choose…

Logic in Computer Science · Computer Science 2015-07-01 Luca de Alfaro , Rupak Majumdar , Vishwanath Raman , Mariëlle Stoelinga

We present metrics for measuring the similarity of states in a finite Markov decision process (MDP). The formulation of our metrics is based on the notion of bisimulation for MDPs, with an aim towards solving discounted infinite horizon…

Artificial Intelligence · Computer Science 2012-07-19 Norman Ferns , Prakash Panangaden , Doina Precup

The bisimulation metric (BSM) is a powerful tool for computing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

Machine Learning · Computer Science 2025-11-04 Zhenyu Tao , Wei Xu , Xiaohu You

Bisimulation metrics define a distance measure between states of a Markov decision process (MDP) based on a comparison of reward sequences. Due to this property they provide theoretical guarantees in value function approximation (VFA). In…

Machine Learning · Computer Science 2022-11-15 Mete Kemertas , Allan Jepson

The bisimulation metric (BSM) is a powerful tool for analyzing state similarities within a Markov decision process (MDP), revealing that states closer in BSM have more similar optimal value functions. While BSM has been successfully…

Machine Learning · Computer Science 2025-12-22 Zhenyu Tao , Wei Xu , Xiaohu You

Bisimulation metrics are powerful tools for measuring similarities between stochastic processes, and specifically Markov chains. Recent advances have uncovered that bisimulation metrics are, in fact, optimal-transport distances, which has…

Machine Learning · Computer Science 2025-05-26 Sergio Calo , Anders Jonsson , Gergely Neu , Ludovic Schwartz , Javier Segovia-Aguas

The probabilistic bisimilarity distance of Deng et al. has been proposed as a robust quantitative generalization of Segala and Lynch's probabilistic bisimilarity for probabilistic automata. In this paper, we present a characterization of…

Formal Languages and Automata Theory · Computer Science 2023-06-22 Giorgio Bacci , Giovanni Bacci , Kim G. Larsen , Radu Mardare , Qiyi Tang , Franck van Breugel

Robust Markov decision processes (RMDPs) extend standard Markov decision processes (MDPs) to account for uncertainty in the transition probabilities. RMDPs have an uncertainty set that defines a set of possible transition functions, each of…

Logic in Computer Science · Computer Science 2026-04-30 Marnix Suilen , Guillermo A. Pérez

We study two-player zero-sum concurrent stochastic games with finite state and action space played for an infinite number of steps. In every step, the two players simultaneously and independently choose an action. Given the current state…

Computer Science and Game Theory · Computer Science 2024-10-10 Ali Asadi , Krishnendu Chatterjee , Raimundo Saona , Jakub Svoboda

Learning generalizeable policies from visual input in the presence of visual distractions is a challenging problem in reinforcement learning. Recently, there has been renewed interest in bisimulation metrics as a tool to address this issue;…

Machine Learning · Computer Science 2022-01-31 Martin Bertran , Walter Talbott , Nitish Srivastava , Joshua Susskind

We consider two-player stochastic games played on a finite graph for infinitely many rounds. Stochastic games generalize both Markov decision processes (MDP) by adding an adversary player, and two-player deterministic games by adding…

Computer Science and Game Theory · Computer Science 2022-02-28 Laurent Doyen

Parity games play a central role in model checking and satisfiability checking. Solving parity games is computationally expensive, among others due to the size of the games, which, for model checking problems, can easily contain $10^9$…

Logic in Computer Science · Computer Science 2016-03-22 S. Cranen , J. J. A. Keiren , T. A. C. Willemse

We present metrics for measuring state similarity in Markov decision processes (MDPs) with infinitely many states, including MDPs with continuous state spaces. Such metrics provide a stable quantitative analogue of the notion of…

Artificial Intelligence · Computer Science 2012-07-09 Norman Ferns , Prakash Panangaden , Doina Precup

Simple stochastic games are turn-based 2.5-player games with a reachability objective. The basic question asks whether one player can ensure reaching a given target with at least a given probability. A natural extension is games with a…

Computer Science and Game Theory · Computer Science 2021-02-02 Pranav Ashok , Krishnendu Chatterjee , Jan Kretinsky , Maximilian Weininger , Tobias Winkler

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2008-09-25 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2008-12-18 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

We introduce a formal notion of masking fault-tolerance between probabilistic transition systems based on a variant of probabilistic bisimulation (named masking simulation). We also provide the corresponding probabilistic game…

Logic in Computer Science · Computer Science 2022-07-06 Pablo F. Castro , Pedro D'Argenio , Luciano Putruele , Ramiro Demasi

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety…

Computer Science and Game Theory · Computer Science 2012-07-03 Krishnendu Chatterjee , Luca de Alfaro , Thomas A. Henzinger

We introduce quantatitive timed refinement and timed simulation (directed) metrics, incorporating zenoness check s, for timed systems. These metrics assign positive real numbers between zero and infinity which quantify the \emph{timing…

Systems and Control · Computer Science 2015-03-19 Krishnendu Chatterjee , Vinayak S. Prabhu
‹ Prev 1 2 3 10 Next ›