Related papers: Explainable Reinforcement Learning Agents Using Wo…

Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey

Broad Explainable Artificial Intelligence moves away from interpreting individual decisions based on a single datum and aims to provide integrated explanations from multiple machine learning algorithms into a coherent explanation of an…

Artificial Intelligence · Computer Science 2021-08-23 Richard Dazeley , Peter Vamplew , Francisco Cruz

Explainable Reinforcement Learning: A Survey

Explainable Artificial Intelligence (XAI), i.e., the development of more transparent and interpretable AI models, has gained increased traction over the last few years. This is due to the fact that, in conjunction with their growth into…

Machine Learning · Computer Science 2020-05-14 Erika Puiutta , Eric MSP Veith

Experiential Explanations for Reinforcement Learning

Reinforcement learning (RL) systems can be complex and non-interpretable, making it challenging for non-AI experts to understand or intervene in their decisions. This is due in part to the sequential nature of RL in which actions are chosen…

Artificial Intelligence · Computer Science 2025-04-16 Amal Alabdulkarim , Madhuri Singh , Gennie Mansi , Kaely Hall , Upol Ehsan , Mark O. Riedl

Explainability in Deep Reinforcement Learning

A large set of the explainable Artificial Intelligence (XAI) literature is emerging on feature relevance techniques to explain a deep neural network (DNN) output or explaining models that ingest image source data. However, assessing how XAI…

Artificial Intelligence · Computer Science 2020-12-21 Alexandre Heuillet , Fabien Couthouis , Natalia Díaz-Rodríguez

A Survey of Explainable Reinforcement Learning: Targets, Methods and Needs

The success of recent Artificial Intelligence (AI) models has been accompanied by the opacity of their internal mechanisms, due notably to the use of deep neural networks. In order to understand these internal mechanisms and explain the…

Artificial Intelligence · Computer Science 2025-07-18 Léo Saulières

Explainable Artificial Intelligence (XAI) for Increasing User Trust in Deep Reinforcement Learning Driven Autonomous Systems

We consider the problem of providing users of deep Reinforcement Learning (RL) based systems with a better understanding of when their output can be trusted. We offer an explainable artificial intelligence (XAI) framework that provides a…

Artificial Intelligence · Computer Science 2021-06-08 Jeff Druce , Michael Harradon , James Tittle

Demystifying Reinforcement Learning in Production Scheduling via Explainable AI

Deep Reinforcement Learning (DRL) is a frequently employed technique to solve scheduling problems. Although DRL agents ace at delivering viable results in short computing times, their reasoning remains opaque. We conduct a case study where…

Artificial Intelligence · Computer Science 2024-09-02 Daniel Fischer , Hannah M. Hüsener , Felix Grumbach , Lukas Vollenkemper , Arthur Müller , Pascal Reusch

Explainable Reinforcement Learning via a Causal World Model

Generating explanations for reinforcement learning (RL) is challenging as actions may produce long-term effects on the future. In this paper, we develop a novel framework for explainable RL by learning a causal world model without prior…

Machine Learning · Computer Science 2024-01-19 Zhongwei Yu , Jingqing Ruan , Dengpeng Xing

GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations

Counterfactual explanations are a common tool to explain artificial intelligence models. For Reinforcement Learning (RL) agents, they answer "Why not?" or "What if?" questions by illustrating what minimal change to a state is needed such…

Machine Learning · Computer Science 2023-02-27 Tobias Huber , Maximilian Demmler , Silvan Mertes , Matthew L. Olson , Elisabeth André

Reversing the Lens: Using Explainable AI to Understand Human Expertise

Both humans and machine learning models learn from experience, particularly in safety- and reliability-critical domains. While psychology seeks to understand human cognition, the field of Explainable AI (XAI) develops methods to interpret…

Human-Computer Interaction · Computer Science 2025-11-25 Roussel Rahman , Aashwin Ananda Mishra , Wan-Lin Hu

Explanation of Reinforcement Learning Model in Dynamic Multi-Agent System

Recently, there has been increasing interest in transparency and interpretability in Deep Reinforcement Learning (DRL) systems. Verbal explanations, as the most natural way of communication in our daily life, deserve more attention, since…

Artificial Intelligence · Computer Science 2020-12-25 Xinzhi Wang , Huao Li , Hui Zhang , Michael Lewis , Katia Sycara

Explaining Agent Behavior with Large Language Models

Intelligent agents such as robots are increasingly deployed in real-world, safety-critical settings. It is vital that these agents are able to explain the reasoning behind their decisions to human counterparts, however, their behavior is…

Machine Learning · Computer Science 2023-09-20 Xijia Zhang , Yue Guo , Simon Stepputtis , Katia Sycara , Joseph Campbell

TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models

Explainable Reinforcement Learning (XRL) has emerged as a promising approach in improving the transparency of Reinforcement Learning (RL) agents. However, there remains a gap between complex RL policies and domain experts, due to the…

Artificial Intelligence · Computer Science 2025-09-09 Haechang Kim , Hao Chen , Can Li , Jong Min Lee

Feature-Based Interpretable Reinforcement Learning based on State-Transition Models

Growing concerns regarding the operational usage of AI models in the real-world has caused a surge of interest in explaining AI models' decisions to humans. Reinforcement Learning is not an exception in this regard. In this work, we propose…

Machine Learning · Computer Science 2023-10-06 Omid Davoodi , Majid Komeili

Explain To Decide: A Human-Centric Review on the Role of Explainable Artificial Intelligence in AI-assisted Decision Making

The unprecedented performance of machine learning models in recent years, particularly Deep Learning and transformer models, has resulted in their application in various domains such as finance, healthcare, and education. However, the…

Human-Computer Interaction · Computer Science 2023-12-20 Milad Rogha

Complementary reinforcement learning towards explainable agents

Reinforcement learning (RL) algorithms allow agents to learn skills and strategies to perform complex tasks without detailed instructions or expensive labelled training examples. That is, RL agents can learn, as we learn. Given the…

Machine Learning · Computer Science 2019-01-25 Jung Hoon Lee

Understanding Your Agent: Leveraging Large Language Models for Behavior Explanation

Intelligent agents such as robots are increasingly deployed in real-world, safety-critical settings. It is vital that these agents are able to explain the reasoning behind their decisions to human counterparts; however, their behavior is…

Machine Learning · Computer Science 2023-12-01 Xijia Zhang , Yue Guo , Simon Stepputtis , Katia Sycara , Joseph Campbell

Contrastive Explanations for Reinforcement Learning in terms of Expected Consequences

Machine Learning models become increasingly proficient in complex tasks. However, even for experts in the field, it can be difficult to understand what the model learned. This hampers trust and acceptance, and it obstructs the possibility…

Machine Learning · Computer Science 2018-07-24 Jasper van der Waa , Jurriaan van Diggelen , Karel van den Bosch , Mark Neerincx

Explainable Reinforcement Learning Through a Causal Lens

Prevalent theories in cognitive science propose that humans understand and represent the knowledge of the world through causal relationships. In making sense of the world, we build causal models in our mind to encode cause-effect relations…

Machine Learning · Computer Science 2019-11-21 Prashan Madumal , Tim Miller , Liz Sonenberg , Frank Vetere

Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning

Counterfactual explanations, which deal with "why not?" scenarios, can provide insightful explanations to an AI agent's behavior. In this work, we focus on generating counterfactual explanations for deep reinforcement learning (RL) agents…

Artificial Intelligence · Computer Science 2021-02-01 Matthew L. Olson , Roli Khanna , Lawrence Neal , Fuxin Li , Weng-Keen Wong