Related papers: CODEX: A Cluster-Based Method for Explainable Rein…

Clustered Reinforcement Learning

Exploration strategy design is one of the challenging problems in reinforcement learning~(RL), especially when the environment contains a large state space or sparse rewards. During exploration, the agent tries to discover novel areas or…

Machine Learning · Computer Science 2019-06-07 Xiao Ma , Shen-Yi Zhao , Wu-Jun Li

Behaviour Discovery and Attribution for Explainable Reinforcement Learning

Building trust in reinforcement learning (RL) agents requires understanding why they make certain decisions, especially in high-stakes applications like robotics, healthcare, and finance. Existing explainability methods often focus on…

Artificial Intelligence · Computer Science 2025-06-18 Rishav Rishav , Somjit Nath , Vincent Michalski , Samira Ebrahimi Kahou

Textual Explanations and Their Evaluations for Reinforcement Learning Policy

Understanding a Reinforcement Learning (RL) policy is crucial for ensuring that autonomous agents behave according to human expectations. This goal can be achieved using Explainable Reinforcement Learning (XRL) techniques. Although textual…

Artificial Intelligence · Computer Science 2026-01-07 Ahmad Terra , Mohit Ahmed , Rafia Inam , Elena Fersman , Martin Törngren

GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations

Counterfactual explanations are a common tool to explain artificial intelligence models. For Reinforcement Learning (RL) agents, they answer "Why not?" or "What if?" questions by illustrating what minimal change to a state is needed such…

Machine Learning · Computer Science 2023-02-27 Tobias Huber , Maximilian Demmler , Silvan Mertes , Matthew L. Olson , Elisabeth André

Inherently Explainable Reinforcement Learning in Natural Language

We focus on the task of creating a reinforcement learning agent that is inherently explainable -- with the ability to produce immediate local explanations by thinking out loud while performing a task and analyzing entire trajectories…

Human-Computer Interaction · Computer Science 2022-10-10 Xiangyu Peng , Mark O. Riedl , Prithviraj Ammanabrolu

Conceptual Reinforcement Learning for Language-Conditioned Tasks

Despite the broad application of deep reinforcement learning (RL), transferring and adapting the policy to unseen but similar environments is still a significant challenge. Recently, the language-conditioned policy is proposed to facilitate…

Machine Learning · Computer Science 2023-03-10 Shaohui Peng , Xing Hu , Rui Zhang , Jiaming Guo , Qi Yi , Ruizhi Chen , Zidong Du , Ling Li , Qi Guo , Yunji Chen

Explaining Reinforcement Learning Agents Through Counterfactual Action Outcomes

Explainable reinforcement learning (XRL) methods aim to help elucidate agent policies and decision-making processes. The majority of XRL approaches focus on local explanations, seeking to shed light on the reasons an agent acts the way it…

Artificial Intelligence · Computer Science 2023-12-19 Yotam Amitai , Yael Septon , Ofra Amir

Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

We study reinforcement learning (RL) for text-based games, which are interactive simulations in the context of natural language. While different methods have been developed to represent the environment information and language actions,…

Machine Learning · Computer Science 2020-12-29 Yunqiu Xu , Meng Fang , Ling Chen , Yali Du , Joey Tianyi Zhou , Chengqi Zhang

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

In this effort we consider a reinforcement learning (RL) technique for solving personalization tasks with complex reward signals. In particular, our approach is based on state space clustering with the use of a simplistic $k$-means…

Machine Learning · Computer Science 2021-12-28 Anton Dereventsov , Ranga Raju Vatsavai , Clayton Webster

Causal State Distillation for Explainable Reinforcement Learning

Reinforcement learning (RL) is a powerful technique for training intelligent agents, but understanding why these agents make specific decisions can be quite challenging. This lack of transparency in RL models has been a long-standing…

Machine Learning · Computer Science 2024-04-02 Wenhao Lu , Xufeng Zhao , Thilo Fryen , Jae Hee Lee , Mengdi Li , Sven Magg , Stefan Wermter

Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey

Reinforcement Learning (RL) has demonstrated a great potential for automatically solving decision-making problems in complex uncertain environments. RL proposes a computational approach that allows learning through interaction in an…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-18 Yisel Garí , David A. Monge , Elina Pacini , Cristian Mateos , Carlos García Garino

A Framework for Understanding and Visualizing Strategies of RL Agents

Recent years have seen significant advances in explainable AI as the need to understand deep learning models has gained importance with the increased emphasis on trust and ethics in AI. Comprehensible models for sequential decision tasks…

Artificial Intelligence · Computer Science 2022-08-19 Pedro Sequeira , Daniel Elenius , Jesse Hostetler , Melinda Gervasio

Semantic HELM: A Human-Readable Memory for Reinforcement Learning

Reinforcement learning agents deployed in the real world often have to cope with partially observable environments. Therefore, most agents employ memory mechanisms to approximate the state of the environment. Recently, there have been…

Machine Learning · Computer Science 2023-10-30 Fabian Paischer , Thomas Adler , Markus Hofmarcher , Sepp Hochreiter

Experiential Explanations for Reinforcement Learning

Reinforcement learning (RL) systems can be complex and non-interpretable, making it challenging for non-AI experts to understand or intervene in their decisions. This is due in part to the sequential nature of RL in which actions are chosen…

Artificial Intelligence · Computer Science 2025-04-16 Amal Alabdulkarim , Madhuri Singh , Gennie Mansi , Kaely Hall , Upol Ehsan , Mark O. Riedl

Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning

While deep reinforcement learning techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of…

Multimedia · Computer Science 2019-06-07 Nicholas Waytowich , Sean L. Barton , Vernon Lawhern , Ethan Stump , Garrett Warnell

Reward-Predictive Clustering

Recent advances in reinforcement-learning research have demonstrated impressive results in building algorithms that can out-perform humans in complex tasks. Nevertheless, creating reinforcement-learning systems that can build abstractions…

Machine Learning · Computer Science 2022-11-08 Lucas Lehnert , Michael J. Frank , Michael L. Littman

RACCER: Towards Reachable and Certain Counterfactual Explanations for Reinforcement Learning

While reinforcement learning (RL) algorithms have been successfully applied to numerous tasks, their reliance on neural networks makes their behavior difficult to understand and trust. Counterfactual explanations are human-friendly…

Artificial Intelligence · Computer Science 2023-10-11 Jasmina Gajcin , Ivana Dusparic

Explainable Reinforcement Learning Through a Causal Lens

Prevalent theories in cognitive science propose that humans understand and represent the knowledge of the world through causal relationships. In making sense of the world, we build causal models in our mind to encode cause-effect relations…

Machine Learning · Computer Science 2019-11-21 Prashan Madumal , Tim Miller , Liz Sonenberg , Frank Vetere

Counterfactual Explanations for Clustering Models

Clustering algorithms rely on complex optimisation processes that may be difficult to comprehend, especially for individuals who lack technical expertise. While many explainable artificial intelligence techniques exist for supervised…

Machine Learning · Computer Science 2024-09-20 Aurora Spagnol , Kacper Sokol , Pietro Barbiero , Marc Langheinrich , Martin Gjoreski

ReLAX: Reinforcement Learning Agent eXplainer for Arbitrary Predictive Models

Counterfactual examples (CFs) are one of the most popular methods for attaching post-hoc explanations to machine learning (ML) models. However, existing CF generation methods either exploit the internals of specific models or depend on each…

Machine Learning · Computer Science 2023-08-10 Ziheng Chen , Fabrizio Silvestri , Jia Wang , He Zhu , Hongshik Ahn , Gabriele Tolomei