Related papers: Learn to Interpret Atari Agents

Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model

One major barrier to applications of deep Reinforcement Learning (RL) both inside and outside of games is the lack of explainability. In this paper, we describe a lightweight and effective method to derive explanations for deep RL agents,…

Machine Learning · Computer Science 2021-10-08 Alexander Sieusahai , Matthew Guzdial

Self-Supervised Discovering of Interpretable Features for Reinforcement Learning

Deep reinforcement learning (RL) has recently led to many breakthroughs on a range of complex control tasks. However, the agent's decision-making process is generally not transparent. The lack of interpretability hinders the applicability…

Computer Vision and Pattern Recognition · Computer Science 2021-03-22 Wenjie Shi , Gao Huang , Shiji Song , Zhuoyuan Wang , Tingyu Lin , Cheng Wu

Visualizing and Understanding Atari Agents

While deep reinforcement learning (deep RL) agents are effective at maximizing rewards, it is often unclear what strategies they use to do so. In this paper, we take a step toward explaining deep RL agents through a case study using Atari…

Artificial Intelligence · Computer Science 2018-09-12 Sam Greydanus , Anurag Koul , Jonathan Dodge , Alan Fern

Investigating the Properties of Neural Network Representations in Reinforcement Learning

In this paper we investigate the properties of representations learned by deep reinforcement learning systems. Much of the early work on representations for reinforcement learning focused on designing fixed-basis architectures to achieve…

Machine Learning · Computer Science 2023-05-08 Han Wang , Erfan Miahi , Martha White , Marlos C. Machado , Zaheer Abbas , Raksha Kumaraswamy , Vincent Liu , Adam White

Visual Rationalizations in Deep Reinforcement Learning for Atari Games

Due to the capability of deep learning to perform well in high dimensional problems, deep reinforcement learning agents perform well in challenging tasks such as Atari 2600 games. However, clearly explaining why a certain action is taken by…

Machine Learning · Computer Science 2019-02-05 Laurens Weitkamp , Elise van der Pol , Zeynep Akata

Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field

Consistent and reproducible evaluation of Deep Reinforcement Learning (DRL) is not straightforward. In the Arcade Learning Environment (ALE), small changes in environment parameters such as stochasticity or the maximum allowed play time can…

Artificial Intelligence · Computer Science 2019-11-11 Marin Toromanoff , Emilie Wirbel , Fabien Moutarde

Machine versus Human Attention in Deep Reinforcement Learning Tasks

Deep reinforcement learning (RL) algorithms are powerful tools for solving visuomotor decision tasks. However, the trained models are often difficult to interpret, because they are represented as end-to-end deep neural networks. In this…

Machine Learning · Computer Science 2021-11-04 Sihang Guo , Ruohan Zhang , Bo Liu , Yifeng Zhu , Mary Hayhoe , Dana Ballard , Peter Stone

Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query

The excellent performance of Transformer in supervised learning has led to growing interest in its potential application to deep reinforcement learning (DRL) to achieve high performance on a wide variety of problems. However, the decision…

Machine Learning · Computer Science 2023-06-27 Hidenori Itaya , Tsubasa Hirakawa , Takayoshi Yamashita , Hironobu Fujiyoshi , Komei Sugiura

Interpretable end-to-end Neurosymbolic Reinforcement Learning agents

Deep reinforcement learning (RL) agents rely on shortcut learning, preventing them from generalizing to slightly different environments. To address this problem, symbolic method, that use object-centric states, have been developed. However,…

Artificial Intelligence · Computer Science 2025-11-04 Nils Grandien , Quentin Delfosse , Kristian Kersting

Deep Reinforcement Learning With Macro-Actions

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation…

Machine Learning · Computer Science 2016-06-16 Ishan P. Durugkar , Clemens Rosenbaum , Stefan Dernbach , Sridhar Mahadevan

Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC

Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent's performance. In this paper, we present "Beyond The Rainbow" (BTR), a novel algorithm that…

Artificial Intelligence · Computer Science 2025-05-22 Tyler Clark , Mark Towers , Christine Evers , Jonathon Hare

Explainable Deep Reinforcement Learning Using Introspection in a Non-episodic Task

Explainable reinforcement learning allows artificial agents to explain their behavior in a human-like manner aiming at non-expert end-users. An efficient alternative of creating explanations is to use an introspection-based method that…

Machine Learning · Computer Science 2021-08-23 Angel Ayala , Francisco Cruz , Bruno Fernandes , Richard Dazeley

Learning to Perceive in Deep Model-Free Reinforcement Learning

This work proposes a novel model-free Reinforcement Learning (RL) agent that is able to learn how to complete an unknown task having access to only a part of the input observation. We take inspiration from the concepts of visual attention…

Machine Learning · Computer Science 2023-01-16 Gonçalo Querido , Alberto Sardinha , Francisco S. Melo

Neural architecture impact on identifying temporally extended Reinforcement Learning tasks

Inspired by recent developments in attention models for image classification and natural language processing, we present various Attention based architectures in reinforcement learning (RL) domain, capable of performing well on OpenAI Gym…

Machine Learning · Computer Science 2023-10-06 Victor Vadakechirayath George

Perspectives for Direct Interpretability in Multi-Agent Deep Reinforcement Learning

Multi-Agent Deep Reinforcement Learning (MADRL) was proven efficient in solving complex problems in robotics or games, yet most of the trained models are hard to interpret. While learning intrinsically interpretable models remains a…

Artificial Intelligence · Computer Science 2025-02-04 Yoann Poupart , Aurélie Beynier , Nicolas Maudet

Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning

We propose a mechanism for distributed resource management and interference mitigation in wireless networks using multi-agent deep reinforcement learning (RL). We equip each transmitter in the network with a deep RL agent that receives…

Machine Learning · Computer Science 2021-01-12 Navid Naderializadeh , Jaroslaw Sydir , Meryem Simsek , Hosein Nikopour

Interpret Policies in Deep Reinforcement Learning using SILVER with RL-Guided Labeling: A Model-level Approach to High-dimensional and Multi-action Environments

Deep reinforcement learning (RL) achieves remarkable performance but lacks interpretability, limiting trust in policy behavior. The existing SILVER framework (Li, Siddique, and Cao 2025) explains RL policy via Shapley-based regression but…

Machine Learning · Computer Science 2025-10-27 Yiyu Qian , Su Nguyen , Chao Chen , Qinyue Zhou , Liyuan Zhao

BlendRL: A Framework for Merging Symbolic and Neural Policy Learning

Humans can leverage both symbolic reasoning and intuitive reactions. In contrast, reinforcement learning policies are typically encoded in either opaque systems like neural networks or symbolic systems that rely on predefined symbols and…

Machine Learning · Computer Science 2025-04-22 Hikaru Shindo , Quentin Delfosse , Devendra Singh Dhami , Kristian Kersting

Playing Atari with Deep Reinforcement Learning

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning,…

Machine Learning · Computer Science 2013-12-20 Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Alex Graves , Ioannis Antonoglou , Daan Wierstra , Martin Riedmiller

Retrieval-Augmented Reinforcement Learning

Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive,…

Machine Learning · Computer Science 2022-05-25 Anirudh Goyal , Abram L. Friesen , Andrea Banino , Theophane Weber , Nan Rosemary Ke , Adria Puigdomenech Badia , Arthur Guez , Mehdi Mirza , Peter C. Humphreys , Ksenia Konyushkova , Laurent Sifre , Michal Valko , Simon Osindero , Timothy Lillicrap , Nicolas Heess , Charles Blundell