English
Related papers

Related papers: Programmatically Interpretable Reinforcement Learn…

200 papers

Programmatically Interpretable Reinforcement Learning (PIRL) encodes policies in human-readable computer programs. Novel algorithms were recently introduced with the goal of handling the lack of gradient signal to guide the search in the…

Machine Learning · Computer Science 2023-08-08 Spyros Orfanos , Levi H. S. Lelis

Deep reinforcement learning (DRL) has achieved significant breakthroughs in various tasks. However, most DRL algorithms suffer a problem of generalizing the learned policy which makes the learning performance largely affected even by minor…

Machine Learning · Computer Science 2019-07-11 Zhengyao Jiang , Shan Luo

Deep reinforcement learning (DRL) has recently emerged as a promising approach to solve combinatorial optimization problems such as job shop scheduling. However, the policies learned by DRL are typically represented by deep neural networks…

Machine Learning · Computer Science 2026-05-19 Chengpeng Hu , Yingqian Zhang , Hendrik Baier

In the field of high-performance computing (HPC), there has been recent exploration into the use of deep reinforcement learning for cluster scheduling (DRL scheduling), which has demonstrated promising outcomes. However, a significant…

Machine Learning · Computer Science 2024-03-26 Boyang Li , Zhiling Lan , Michael E. Papka

Reinforcement Learning (RL) bears the promise of being a game-changer in many applications. However, since most of the literature in the field is currently focused on opaque models, the use of RL in high-stakes scenarios, where…

Machine Learning · Computer Science 2025-01-22 Leonardo Lucio Custode , Giovanni Iacca

Recent advances in Reinforcement Learning (RL) largely benefit from the inclusion of Deep Neural Networks, boosting the number of novel approaches proposed in the field of Deep Reinforcement Learning (DRL). These techniques demonstrate the…

Machine Learning · Computer Science 2025-07-30 Giovanni Dispoto , Paolo Bonetti , Marcello Restelli

Visual coverage path planning with unmanned aerial vehicles (UAVs) requires agents to strategically coordinate UAV motion and camera control to maximize coverage, minimize redundancy, and maintain battery efficiency. Traditional…

Robotics · Computer Science 2025-07-15 Venkat Margapuri

Despite numerous successes in Deep Reinforcement Learning (DRL), the learned policies are not interpretable. Moreover, since DRL does not exploit symbolic relational representations, it has difficulties in coping with structural changes in…

Artificial Intelligence · Computer Science 2023-07-17 Rishi Hazra , Luc De Raedt

Autonomous urban driving navigation with complex multi-agent dynamics is under-explored due to the difficulty of learning an optimal driving policy. The traditional modular pipeline heavily relies on hand-designed rules and the…

Computer Vision and Pattern Recognition · Computer Science 2018-07-11 Xiaodan Liang , Tairui Wang , Luona Yang , Eric Xing

The search for interpretable reinforcement learning policies is of high academic and industrial interest. Especially for industrial systems, domain experts are more likely to deploy autonomously learned controllers if they are…

Artificial Intelligence · Computer Science 2018-04-05 Daniel Hein , Steffen Udluft , Thomas A. Runkler

Process rewards have been widely used in deep reinforcement learning to improve training efficiency, reduce variance, and prevent reward hacking. In LLM reasoning, existing works also explore various solutions for learning effective process…

Machine Learning · Computer Science 2026-05-21 Xian Wu , Kaijie Zhu , Ying Zhang , Lun Wang , Wenbo Guo

Deep reinforcement learning (DRL) has emerged as a powerful framework for solving sequential decision-making problems, achieving remarkable success in a wide range of applications, including game AI, autonomous driving, biomedicine, and…

Machine Learning · Computer Science 2025-05-14 Yinghan Sun , Hongxi Wang , Hua Chen , Wei Zhang

Recently, deep reinforcement learning (DRL) methods have achieved impressive performance on tasks in a variety of domains. However, neural network policies produced with DRL methods are not human-interpretable and often have difficulty…

Machine Learning · Computer Science 2022-02-02 Dweep Trivedi , Jesse Zhang , Shao-Hua Sun , Joseph J. Lim

Travel demand modeling has shifted from aggregated trip-based models to behavior-oriented activity-based models because daily trips are essentially driven by human activities. To analyze the sequential activity-travel decisions, deep…

Artificial Intelligence · Computer Science 2025-03-18 Yuebing Liang , Shenhao Wang , Jiangbo Yu , Zhan Zhao , Jinhua Zhao , Sandy Pentland

Deep reinforcement learning has demonstrated remarkable achievements across diverse domains such as video games, robotic control, autonomous driving, and drug discovery. Common methodologies in partially-observable domains largely lean on…

Machine Learning · Computer Science 2024-02-15 Michael Lanier , Ying Xu , Nathan Jacobs , Chongjie Zhang , Yevgeniy Vorobeychik

One approach to explaining the hierarchical levels of understanding within a machine learning model is the symbolic method of inductive logic programming (ILP), which is data efficient and capable of learning first-order logic rules that…

Machine Learning · Computer Science 2023-09-01 Andreas Bueff , Vaishak Belle

Deep reinforcement learning (DRL) has been used to learn effective heuristics for solving complex combinatorial optimisation problem via policy networks and have demonstrated promising performance. Existing works have focused on solving…

Machine Learning · Computer Science 2020-12-25 Nasrin Sultana , Jeffrey Chan , A. K. Qin , Tabinda Sarwar

With Deep Reinforcement Learning (DRL) being increasingly considered for the control of real-world systems, the lack of transparency of the neural network at the core of RL becomes a concern. Programmatic Reinforcement Learning (PRL) is…

Machine Learning · Computer Science 2024-10-30 Senne Deproost , Denis Steckelmacher , Ann Nowé

Preference-based reinforcement learning (PbRL) is an approach that enables RL agents to learn from preference, which is particularly useful when formulating a reward function is challenging. Existing PbRL methods generally involve a…

Machine Learning · Computer Science 2023-10-30 Gaon An , Junhyeok Lee , Xingdong Zuo , Norio Kosaka , Kyung-Min Kim , Hyun Oh Song

We study the problem of programmatic reinforcement learning, in which policies are represented as short programs in a symbolic language. Programmatic policies can be more interpretable, generalizable, and amenable to formal verification…

Machine Learning · Computer Science 2021-01-21 Abhinav Verma , Hoang M. Le , Yisong Yue , Swarat Chaudhuri
‹ Prev 1 2 3 10 Next ›