English
Related papers

Related papers: Dual Behavior Regularized Reinforcement Learning

200 papers

This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide…

Machine Learning · Computer Science 2024-10-21 Anay Pattanaik , Lav R. Varshney

Counterfactual thinking describes a psychological phenomenon that people re-infer the possible results with different solutions about things that have already happened. It helps people to gain more experience from mistakes and thus to…

Machine Learning · Computer Science 2019-08-19 Yue Wang , Yao Wan , Chenwei Zhang , Lixin Cui , Lu Bai , Philip S. Yu

Reinforcement learning algorithms describe how an agent can learn an optimal action policy in a sequential decision process, through repeated experience. In a given environment, the agent policy provides him some running and terminal…

Theoretical Economics · Economics 2020-03-24 Arthur Charpentier , Romuald Elie , Carl Remlinger

This study investigates behavior-targeted attacks on reinforcement learning and their countermeasures. Behavior-targeted attacks aim to manipulate the victim's behavior as desired by the adversary through adversarial interventions in state…

Machine Learning · Computer Science 2026-02-18 Shojiro Yamabe , Kazuto Fukuchi , Jun Sakuma

Advances in reinforcement learning research have demonstrated the ways in which different agent-based models can learn how to optimally perform a task within a given environment. Reinforcement leaning solves unsupervised problems where…

Machine Learning · Computer Science 2022-11-03 Herkulaas Combrink , Vukosi Marivate , Benjamin Rosman

We introduce the framework of performative reinforcement learning where the policy chosen by the learner affects the underlying reward and transition dynamics of the environment. Following the recent literature on performative…

Machine Learning · Computer Science 2023-06-08 Debmalya Mandal , Stelios Triantafyllou , Goran Radanovic

Off-policy reinforcement learning algorithms promise to be applicable in settings where only a fixed data-set (batch) of environment interactions is available and no new experience can be acquired. This property makes these algorithms…

While Reinforcement Learning can achieve impressive results for complex tasks, the learned policies are generally prone to fail in downstream tasks with even minor model mismatch or unexpected perturbations. Recent works have demonstrated…

Machine Learning · Computer Science 2023-05-23 Kang Xu , Yan Ma , Bingsheng Wei , Wei Li

Deep reinforcement learning algorithms that estimate state and state-action value functions have been shown to be effective in a variety of challenging domains, including learning control strategies from raw image pixels. However,…

Machine Learning · Computer Science 2018-10-26 Peter Jin , Kurt Keutzer , Sergey Levine

The problem of reinforcement learning is considered where the environment or the model undergoes a change. An algorithm is proposed that an agent can apply in such a problem to achieve the optimal long-time discounted reward. The algorithm…

Systems and Control · Electrical Eng. & Systems 2023-04-25 Wuxia Chen , Taposh Banerjee , Jemin George , Carl Busart

Reinforcement learning is a machine learning approach based on behavioral psychology. It is focused on learning agents that can acquire knowledge and learn to carry out new tasks by interacting with the environment. However, a problem…

Artificial Intelligence · Computer Science 2022-12-15 Hugo Muñoz , Ernesto Portugal , Angel Ayala , Bruno Fernandes , Francisco Cruz

Real-world autonomous decision-making systems, from robots to recommendation engines, must operate in environments that change over time. While deep reinforcement learning (RL) has shown an impressive ability to learn optimal policies in…

Machine Learning · Computer Science 2025-05-16 Jonathan Clifford Balloch

Developments in reinforcement learning (RL) have allowed algorithms to achieve impressive performance in highly complex, but largely static problems. In contrast, biological learning seems to value efficiency of adaptation to a…

Artificial Intelligence · Computer Science 2022-05-20 Eric Chalmers , Artur Luczak

Policy evaluation estimates the performance of a policy by (1) collecting data from the environment and (2) processing raw data into a meaningful estimate. Due to the sequential nature of reinforcement learning, any improper data-collecting…

Machine Learning · Computer Science 2025-03-21 Shuze Daniel Liu , Claire Chen , Shangtong Zhang

The ability to exploit prior experience to solve novel problems rapidly is a hallmark of biological learning systems and of great practical importance for artificial ones. In the meta reinforcement learning literature much recent work has…

We consider a problem of learning the reward and policy from expert examples under unknown dynamics. Our proposed method builds on the framework of generative adversarial networks and introduces the empowerment-regularized maximum-entropy…

Machine Learning · Computer Science 2019-02-26 Ahmed H. Qureshi , Byron Boots , Michael C. Yip

Reinforcement learning (RL) and causal modelling naturally complement each other. The goal of causal modelling is to predict the effects of interventions in an environment, while the goal of reinforcement learning is to select interventions…

Machine Learning · Computer Science 2024-07-12 Oliver Schulte , Pascal Poupart

We review basic concepts of convex duality, focusing on the very general and supremely useful Fenchel-Rockafellar duality. We summarize how this duality may be applied to a variety of reinforcement learning (RL) settings, including policy…

Machine Learning · Computer Science 2020-01-13 Ofir Nachum , Bo Dai

Assigning resources in business processes execution is a repetitive task that can be effectively automated. However, different automation methods may give varying results that may not be optimal. Proper resource allocation is crucial as it…

Machine Learning · Computer Science 2021-04-02 Kamil Żbikowski , Michał Ostapowicz , Piotr Gawrysiak

Reinforcement learning (RL) agents typically optimize their policies by performing expensive backward passes to update their network parameters. However, some agents can solve new tasks without updating any parameters by simply conditioning…

Machine Learning · Computer Science 2025-02-13 Amir Moeini , Jiuqi Wang , Jacob Beck , Ethan Blaser , Shimon Whiteson , Rohan Chandra , Shangtong Zhang
‹ Prev 1 2 3 10 Next ›