Related papers: Concept Learning for Interpretable Multi-Agent Rei…

Interpretable Reinforcement Learning with Multilevel Subgoal Discovery

We propose a novel Reinforcement Learning model for discrete environments, which is inherently interpretable and supports the discovery of deep subgoal hierarchies. In the model, an agent learns information about environment in the form of…

Artificial Intelligence · Computer Science 2022-02-16 Alexander Demin , Denis Ponomaryov

Perspectives for Direct Interpretability in Multi-Agent Deep Reinforcement Learning

Multi-Agent Deep Reinforcement Learning (MADRL) was proven efficient in solving complex problems in robotics or games, yet most of the trained models are hard to interpret. While learning intrinsically interpretable models remains a…

Artificial Intelligence · Computer Science 2025-02-04 Yoann Poupart , Aurélie Beynier , Nicolas Maudet

Modeling Conceptual Understanding in Image Reference Games

An agent who interacts with a wide population of other agents needs to be aware that there may be variations in their understanding of the world. Furthermore, the machinery which they use to perceive may be inherently different, as is the…

Artificial Intelligence · Computer Science 2019-11-20 Rodolfo Corona , Stephan Alaniz , Zeynep Akata

Learning Interpretable Concept-Based Models with Human Feedback

Machine learning models that first learn a representation of a domain in terms of human-understandable concepts, then use it to make predictions, have been proposed to facilitate interpretation and interaction with models trained on…

Machine Learning · Computer Science 2020-12-08 Isaac Lage , Finale Doshi-Velez

Deep Interpretable Models of Theory of Mind

When developing AI systems that interact with humans, it is essential to design both a system that can understand humans, and a system that humans can understand. Most deep network based agent-modeling approaches are 1) not interpretable…

Machine Learning · Computer Science 2021-07-14 Ini Oguntola , Dana Hughes , Katia Sycara

Learning Interpretable Concepts: Unifying Causal Representation Learning and Foundation Models

To build intelligent machine learning systems, there are two broad approaches. One approach is to build inherently interpretable models, as endeavored by the growing field of causal representation learning. The other approach is to build…

Machine Learning · Computer Science 2024-12-10 Goutham Rajendran , Simon Buchholz , Bryon Aragam , Bernhard Schölkopf , Pradeep Ravikumar

"So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents

Recent advances in Reinforcement Learning (RL) largely benefit from the inclusion of Deep Neural Networks, boosting the number of novel approaches proposed in the field of Deep Reinforcement Learning (DRL). These techniques demonstrate the…

Machine Learning · Computer Science 2025-07-30 Giovanni Dispoto , Paolo Bonetti , Marcello Restelli

Meaningful Models: Utilizing Conceptual Structure to Improve Machine Learning Interpretability

The last decade has seen huge progress in the development of advanced machine learning models; however, those models are powerless unless human users can interpret them. Here we show how the mind's construction of concepts and meaning can…

Machine Learning · Statistics 2016-07-04 Nick Condry

Exploiting Interpretable Capabilities with Concept-Enhanced Diffusion and Prototype Networks

Concept-based machine learning methods have increasingly gained importance due to the growing interest in making neural networks interpretable. However, concept annotations are generally challenging to obtain, making it crucial to leverage…

Machine Learning · Computer Science 2024-11-06 Alba Carballo-Castro , Sonia Laguna , Moritz Vandenhirtz , Julia E. Vogt

Interpretable to Whom? A Role-based Model for Analyzing Interpretable Machine Learning Systems

Several researchers have argued that a machine learning system's interpretability should be defined in relation to a specific agent or task: we should not ask if the system is interpretable, but to whom is it interpretable. We describe a…

Artificial Intelligence · Computer Science 2018-06-21 Richard Tomsett , Dave Braines , Dan Harborne , Alun Preece , Supriyo Chakraborty

Towards an Interpretable Hierarchical Agent Framework using Semantic Goals

Learning to solve long horizon temporally extended tasks with reinforcement learning has been a challenge for several years now. We believe that it is important to leverage both the hierarchical structure of complex tasks and to use expert…

Machine Learning · Computer Science 2022-10-18 Bharat Prakash , Nicholas Waytowich , Tim Oates , Tinoosh Mohsenin

Reinforcement Learning in an Adaptable Chess Environment for Detecting Human-understandable Concepts

Self-trained autonomous agents developed using machine learning are showing great promise in a variety of control settings, perhaps most remarkably in applications involving autonomous vehicles. The main challenge associated with…

Machine Learning · Computer Science 2022-11-11 Patrik Hammersborg , Inga Strümke

Evaluating Interpretable Reinforcement Learning by Distilling Policies into Programs

There exist applications of reinforcement learning like medicine where policies need to be ''interpretable'' by humans. User studies have shown that some policy classes might be more interpretable than others. However, it is costly to…

Machine Learning · Computer Science 2025-03-12 Hector Kohler , Quentin Delfosse , Waris Radji , Riad Akrour , Philippe Preux

Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning

Recently, model-based agents have achieved better performance than model-free ones using the same computational budget and training time in single-agent environments. However, due to the complexity of multi-agent systems, it is tough to…

Multiagent Systems · Computer Science 2022-12-08 Zhiwei Xu , Dapeng Li , Bin Zhang , Yuan Zhan , Yunpeng Bai , Guoliang Fan

Responsible Emergent Multi-Agent Behavior

Responsible AI has risen to the forefront of the AI research community. As neural network-based learning algorithms continue to permeate real-world applications, the field of Responsible AI has played a large role in ensuring that such…

Artificial Intelligence · Computer Science 2023-11-06 Niko A. Grupen

Multiagent Reinforcement Learning with Neighbor Action Estimation

Multiagent reinforcement learning, as a prominent intelligent paradigm, enables collaborative decision-making within complex systems. However, existing approaches often rely on explicit action exchange between agents to evaluate action…

Robotics · Computer Science 2026-01-09 Zhenglong Luo , Zhiyong Chen , Aoxiang Liu

Continuous Action Reinforcement Learning from a Mixture of Interpretable Experts

Reinforcement learning (RL) has demonstrated its ability to solve high dimensional tasks by leveraging non-linear function approximators. However, these successes are mostly achieved by 'black-box' policies in simulated domains. When…

Machine Learning · Computer Science 2021-11-19 Riad Akrour , Davide Tateo , Jan Peters

Discovering Latent States for Model Learning: Applying Sensorimotor Contingencies Theory and Predictive Processing to Model Context

Autonomous robots need to be able to adapt to unforeseen situations and to acquire new skills through trial and error. Reinforcement learning in principle offers a suitable methodological framework for this kind of autonomous learning.…

Robotics · Computer Science 2016-08-02 Nikolas J. Hemion

Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning

Many potential applications of reinforcement learning in the real world involve interacting with other agents whose numbers vary over time. We propose new neural policy architectures for these multi-agent problems. In contrast to other…

Machine Learning · Computer Science 2019-06-03 Matthew A. Wright , Roberto Horowitz

Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario

Reinforcement learning is a machine learning approach based on behavioral psychology. It is focused on learning agents that can acquire knowledge and learn to carry out new tasks by interacting with the environment. However, a problem…

Artificial Intelligence · Computer Science 2022-12-15 Hugo Muñoz , Ernesto Portugal , Angel Ayala , Bruno Fernandes , Francisco Cruz