Related papers: Programmatically Interpretable Reinforcement Learn…

Synthesizing Programmatic Policies with Actor-Critic Algorithms and ReLU Networks

Programmatically Interpretable Reinforcement Learning (PIRL) encodes policies in human-readable computer programs. Novel algorithms were recently introduced with the goal of handling the lack of gradient signal to guide the search in the…

Machine Learning · Computer Science 2023-08-08 Spyros Orfanos , Levi H. S. Lelis

Neural Logic Reinforcement Learning

Deep reinforcement learning (DRL) has achieved significant breakthroughs in various tasks. However, most DRL algorithms suffer a problem of generalizing the learned policy which makes the learning performance largely affected even by minor…

Machine Learning · Computer Science 2019-07-11 Zhengyao Jiang , Shan Luo

Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework

Deep reinforcement learning (DRL) has recently emerged as a promising approach to solve combinatorial optimization problems such as job shop scheduling. However, the policies learned by DRL are typically represented by deep neural networks…

Machine Learning · Computer Science 2026-05-19 Chengpeng Hu , Yingqian Zhang , Hendrik Baier

Interpretable Modeling of Deep Reinforcement Learning Driven Scheduling

In the field of high-performance computing (HPC), there has been recent exploration into the use of deep reinforcement learning for cluster scheduling (DRL scheduling), which has demonstrated promising outcomes. However, a significant…

Machine Learning · Computer Science 2024-03-26 Boyang Li , Zhiling Lan , Michael E. Papka

Social Interpretable Reinforcement Learning

Reinforcement Learning (RL) bears the promise of being a game-changer in many applications. However, since most of the literature in the field is currently focused on opaque models, the use of RL in high-stakes scenarios, where…

Machine Learning · Computer Science 2025-01-22 Leonardo Lucio Custode , Giovanni Iacca

"So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents

Recent advances in Reinforcement Learning (RL) largely benefit from the inclusion of Deep Neural Networks, boosting the number of novel approaches proposed in the field of Deep Reinforcement Learning (DRL). These techniques demonstrate the…

Machine Learning · Computer Science 2025-07-30 Giovanni Dispoto , Paolo Bonetti , Marcello Restelli

Prompt Informed Reinforcement Learning for Visual Coverage Path Planning

Visual coverage path planning with unmanned aerial vehicles (UAVs) requires agents to strategically coordinate UAV motion and camera control to maximize coverage, minimize redundancy, and maintain battery efficiency. Traditional…

Robotics · Computer Science 2025-07-15 Venkat Margapuri

Deep Explainable Relational Reinforcement Learning: A Neuro-Symbolic Approach

Despite numerous successes in Deep Reinforcement Learning (DRL), the learned policies are not interpretable. Moreover, since DRL does not exploit symbolic relational representations, it has difficulties in coping with structural changes in…

Artificial Intelligence · Computer Science 2023-07-17 Rishi Hazra , Luc De Raedt

CIRL: Controllable Imitative Reinforcement Learning for Vision-based Self-driving

Autonomous urban driving navigation with complex multi-agent dynamics is under-explored due to the difficulty of learning an optimal driving policy. The traditional modular pipeline heavily relies on hand-designed rules and the…

Computer Vision and Pattern Recognition · Computer Science 2018-07-11 Xiaodan Liang , Tairui Wang , Luona Yang , Eric Xing

Interpretable Policies for Reinforcement Learning by Genetic Programming

The search for interpretable reinforcement learning policies is of high academic and industrial interest. Especially for industrial systems, domain experts are more likely to deploy autonomously learned controllers if they are…

Artificial Intelligence · Computer Science 2018-04-05 Daniel Hein , Steffen Udluft , Thomas A. Runkler

rePIRL: Learn PRM with Inverse RL for LLM Reasoning

Process rewards have been widely used in deep reinforcement learning to improve training efficiency, reduce variance, and prevent reward hacking. In LLM reasoning, existing works also explore various solutions for learning effective process…

Machine Learning · Computer Science 2026-05-21 Xian Wu , Kaijie Zhu , Ying Zhang , Lun Wang , Wenbo Guo

A Practical Introduction to Deep Reinforcement Learning

Deep reinforcement learning (DRL) has emerged as a powerful framework for solving sequential decision-making problems, achieving remarkable success in a wide range of applications, including game AI, autonomous driving, biomedicine, and…

Machine Learning · Computer Science 2025-05-14 Yinghan Sun , Hongxi Wang , Hua Chen , Wei Zhang

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Recently, deep reinforcement learning (DRL) methods have achieved impressive performance on tasks in a variety of domains. However, neural network policies produced with DRL methods are not human-interpretable and often have difficulty…

Machine Learning · Computer Science 2022-02-02 Dweep Trivedi , Jesse Zhang , Shao-Hua Sun , Joseph J. Lim

Analyzing sequential activity and travel decisions with interpretable deep inverse reinforcement learning

Travel demand modeling has shifted from aggregated trip-based models to behavior-oriented activity-based models because daily trips are essentially driven by human activities. To analyze the sequential activity-travel decisions, deep…

Artificial Intelligence · Computer Science 2025-03-18 Yuebing Liang , Shenhao Wang , Jiangbo Yu , Zhan Zhao , Jinhua Zhao , Sandy Pentland

Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning

Deep reinforcement learning has demonstrated remarkable achievements across diverse domains such as video games, robotic control, autonomous driving, and drug discovery. Common methodologies in partially-observable domains largely lean on…

Machine Learning · Computer Science 2024-02-15 Michael Lanier , Ying Xu , Nathan Jacobs , Chongjie Zhang , Yevgeniy Vorobeychik

Deep Inductive Logic Programming meets Reinforcement Learning

One approach to explaining the hierarchical levels of understanding within a machine learning model is the symbolic method of inductive logic programming (ILP), which is data efficient and capable of learning first-order logic rules that…

Machine Learning · Computer Science 2023-09-01 Andreas Bueff , Vaishak Belle

Learning Vehicle Routing Problems using Policy Optimisation

Deep reinforcement learning (DRL) has been used to learn effective heuristics for solving complex combinatorial optimisation problem via policy networks and have demonstrated promising performance. Existing works have focused on solving…

Machine Learning · Computer Science 2020-12-25 Nasrin Sultana , Jeffrey Chan , A. K. Qin , Tabinda Sarwar

Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution

With Deep Reinforcement Learning (DRL) being increasingly considered for the control of real-world systems, the lack of transparency of the neural network at the core of RL becomes a concern. Programmatic Reinforcement Learning (PRL) is…

Machine Learning · Computer Science 2024-10-30 Senne Deproost , Denis Steckelmacher , Ann Nowé

Direct Preference-based Policy Optimization without Reward Modeling

Preference-based reinforcement learning (PbRL) is an approach that enables RL agents to learn from preference, which is particularly useful when formulating a reward function is challenging. Existing PbRL methods generally involve a…

Machine Learning · Computer Science 2023-10-30 Gaon An , Junhyeok Lee , Xingdong Zuo , Norio Kosaka , Kyung-Min Kim , Hyun Oh Song

Imitation-Projected Programmatic Reinforcement Learning

We study the problem of programmatic reinforcement learning, in which policies are represented as short programs in a symbolic language. Programmatic policies can be more interpretable, generalizable, and amenable to formal verification…

Machine Learning · Computer Science 2021-01-21 Abhinav Verma , Hoang M. Le , Yisong Yue , Swarat Chaudhuri