Related papers: Dual Behavior Regularized Reinforcement Learning

Online Reinforcement Learning with Passive Memory

This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide…

Machine Learning · Computer Science 2024-10-21 Anay Pattanaik , Lav R. Varshney

Competitive Multi-Agent Deep Reinforcement Learning with Counterfactual Thinking

Counterfactual thinking describes a psychological phenomenon that people re-infer the possible results with different solutions about things that have already happened. It helps people to gain more experience from mistakes and thus to…

Machine Learning · Computer Science 2019-08-19 Yue Wang , Yao Wan , Chenwei Zhang , Lixin Cui , Lu Bai , Philip S. Yu

Reinforcement Learning in Economics and Finance

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

Theoretical Economics · Economics 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

Robust Deep Reinforcement Learning against Adversarial Behavior Manipulation

This study investigates behavior-targeted attacks on reinforcement learning and their countermeasures. Behavior-targeted attacks aim to manipulate the victim's behavior as desired by the adversary through adversarial interventions in state…

Machine Learning · Computer Science 2026-02-18 Shojiro Yamabe , Kazuto Fukuchi , Jun Sakuma

Reinforcement Learning in Education: A Multi-Armed Bandit Approach

Advances in reinforcement learning research have demonstrated the ways in which different agent-based models can learn how to optimally perform a task within a given environment. Reinforcement leaning solves unsupervised problems where…

Machine Learning · Computer Science 2022-11-03 Herkulaas Combrink , Vukosi Marivate , Benjamin Rosman

Performative Reinforcement Learning

We introduce the framework of performative reinforcement learning where the policy chosen by the learner affects the underlying reward and transition dynamics of the environment. Following the recent literature on performative…

Machine Learning · Computer Science 2023-06-08 Debmalya Mandal , Stelios Triantafyllou , Goran Radanovic

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

Off-policy reinforcement learning algorithms promise to be applicable in settings where only a fixed data-set (batch) of environment interactions is available and no new experience can be acquired. This property makes these algorithms…

Machine Learning · Computer Science 2020-06-18 Noah Y. Siegel , Jost Tobias Springenberg , Felix Berkenkamp , Abbas Abdolmaleki , Michael Neunert , Thomas Lampe , Roland Hafner , Nicolas Heess , Martin Riedmiller

Open-Ended Diverse Solution Discovery with Regulated Behavior Patterns for Cross-Domain Adaptation

While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations. Recent works have demonstrated…

Machine Learning · Computer Science 2023-05-23 Kang Xu , Yan Ma , Bingsheng Wei , Wei Li

Regret Minimization for Partially Observable Deep Reinforcement Learning

Deep reinforcement learning algorithms that estimate state and state-action value functions have been shown to be effective in a variety of challenging domains, including learning control strategies from raw image pixels. However,…

Machine Learning · Computer Science 2018-10-26 Peter Jin , Kurt Keutzer , Sergey Levine

Reinforcement Learning with an Abrupt Model Change

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

Systems and Control · Electrical Eng. & Systems 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario

Reinforcement learning is a machine learning approach based on behavioral psychology. It is focused on learning agents that can acquire knowledge and learn to carry out new tasks by interacting with the environment. However, a problem…

Artificial Intelligence · Computer Science 2022-12-15 Hugo Muñoz , Ernesto Portugal , Angel Ayala , Bruno Fernandes , Francisco Cruz

Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change

Real-world autonomous decision-making systems, from robots to recommendation engines, must operate in environments that change over time. While deep reinforcement learning (RL) has shown an impressive ability to learn optimal policies in…

Machine Learning · Computer Science 2025-05-16 Jonathan Clifford Balloch

Reinforcement Learning with Brain-Inspired Modulation can Improve Adaptation to Environmental Changes

Developments in reinforcement learning (RL) have allowed algorithms to achieve impressive performance in highly complex, but largely static problems. In contrast, biological learning seems to value efficiency of adaptation to a…

Artificial Intelligence · Computer Science 2022-05-20 Eric Chalmers , Artur Luczak

Doubly Optimal Policy Evaluation for Reinforcement Learning

Policy evaluation estimates the performance of a policy by (1) collecting data from the environment and (2) processing raw data into a meaningful estimate. Due to the sequential nature of reinforcement learning, any improper data-collecting…

Machine Learning · Computer Science 2025-03-21 Shuze Daniel Liu , Claire Chen , Shangtong Zhang

Importance Weighted Policy Learning and Adaptation

The ability to exploit prior experience to solve novel problems rapidly is a hallmark of biological learning systems and of great practical importance for artificial ones. In the meta reinforcement learning literature much recent work has…

Machine Learning · Computer Science 2021-06-07 Alexandre Galashov , Jakub Sygnowski , Guillaume Desjardins , Jan Humplik , Leonard Hasenclever , Rae Jeong , Yee Whye Teh , Nicolas Heess

Adversarial Imitation via Variational Inverse Reinforcement Learning

We consider a problem of learning the reward and policy from expert examples under unknown dynamics. Our proposed method builds on the framework of generative adversarial networks and introduces the empowerment-regularized maximum-entropy…

Machine Learning · Computer Science 2019-02-26 Ahmed H. Qureshi , Byron Boots , Michael C. Yip

Why Online Reinforcement Learning is Causal

Reinforcement learning (RL) and causal modelling naturally complement each other. The goal of causal modelling is to predict the effects of interventions in an environment, while the goal of reinforcement learning is to select interventions…

Machine Learning · Computer Science 2024-07-12 Oliver Schulte , Pascal Poupart

Reinforcement Learning via Fenchel-Rockafellar Duality

We review basic concepts of convex duality, focusing on the very general and supremely useful Fenchel-Rockafellar duality. We summarize how this duality may be applied to a variety of reinforcement learning (RL) settings, including policy…

Machine Learning · Computer Science 2020-01-13 Ofir Nachum , Bo Dai

Deep Reinforcement Learning for Resource Allocation in Business Processes

Assigning resources in business processes execution is a repetitive task that can be effectively automated. However, different automation methods may give varying results that may not be optimal. Proper resource allocation is crucial as it…

Machine Learning · Computer Science 2021-04-02 Kamil Żbikowski , Michał Ostapowicz , Piotr Gawrysiak

A Survey of In-Context Reinforcement Learning

Reinforcement learning (RL) agents typically optimize their policies by performing expensive backward passes to update their network parameters. However, some agents can solve new tasks without updating any parameters by simply conditioning…

Machine Learning · Computer Science 2025-02-13 Amir Moeini , Jiuqi Wang , Jacob Beck , Ethan Blaser , Shimon Whiteson , Rohan Chandra , Shangtong Zhang