Related papers: Task Scheduling & Forgetting in Multi-Task Reinfor…

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting

Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating this…

Machine Learning · Computer Science 2025-12-04 Howard Chen , Noam Razin , Karthik Narasimhan , Danqi Chen

Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

Reinforcement learning (RL) is a framework to optimize a control policy using rewards that are revealed by the system as a response to a control action. In its standard form, RL involves a single agent that uses its policy to accomplish a…

Systems and Control · Electrical Eng. & Systems 2021-11-24 Juan Cervino , Juan Andres Bazerque , Miguel Calvo-Fullana , Alejandro Ribeiro

Task-Agnostic Continual Reinforcement Learning: Gaining Insights and Overcoming Challenges

Continual learning (CL) enables the development of models and agents that learn from a sequence of tasks while addressing the limitations of standard deep learning approaches, such as catastrophic forgetting. In this work, we investigate…

Machine Learning · Computer Science 2023-05-19 Massimo Caccia , Jonas Mueller , Taesup Kim , Laurent Charlin , Rasool Fakoor

Machine versus Human Attention in Deep Reinforcement Learning Tasks

Deep reinforcement learning (RL) algorithms are powerful tools for solving visuomotor decision tasks. However, the trained models are often difficult to interpret, because they are represented as end-to-end deep neural networks. In this…

Machine Learning · Computer Science 2021-11-04 Sihang Guo , Ruohan Zhang , Bo Liu , Yifeng Zhu , Mary Hayhoe , Dana Ballard , Peter Stone

Learning to Modulate pre-trained Models in RL

Reinforcement Learning (RL) has been successful in various domains like robotics, game playing, and simulation. While RL agents have shown impressive capabilities in their specific tasks, they insufficiently adapt to new tasks. In…

Machine Learning · Computer Science 2023-10-30 Thomas Schmied , Markus Hofmarcher , Fabian Paischer , Razvan Pascanu , Sepp Hochreiter

Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning

Effective decision-making in the real world depends on memory that is both stable and adaptive: environments change over time, and agents must retain relevant information over long horizons while also updating or overwriting outdated…

Machine Learning · Computer Science 2026-01-22 Oleg Shchendrigin , Egor Cherepanov , Alexey K. Kovalev , Aleksandr I. Panov

Think Before You Act: Decision Transformers with Working Memory

Decision Transformer-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and computation. We argue that this inefficiency stems from the forgetting…

Machine Learning · Computer Science 2024-05-30 Jikun Kang , Romain Laroche , Xingdi Yuan , Adam Trischler , Xue Liu , Jie Fu

Improving generalization to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents

Adapting a Reinforcement Learning (RL) agent to an unseen environment is a difficult task due to typical over-fitting on the training environment. RL agents are often capable of solving environments very close to the trained environment,…

Artificial Intelligence · Computer Science 2022-07-04 Olivier Moulin , Vincent Francois-Lavet , Paul Elbers , Mark Hoogendoorn

Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes

Reinforcement Learning (RL) agents often exhibit learning behaviors that are not intuitively interpretable by human observers, which can result in suboptimal feedback in collaborative teaching settings. Yet, how humans perceive and…

Human-Computer Interaction · Computer Science 2025-06-17 Bernhard Hilpert , Muhan Hou , Kim Baraka , Joost Broekens

Forgetting in Language Models: Capacity, Optimization, and Self-Generated Replay

Models trained on a new task typically degrade on prior tasks, a phenomenon known as forgetting. Traditionally, mitigating forgetting has required replaying stored exemplars from prior tasks, which is often impractical. By contrast,…

Machine Learning · Computer Science 2026-05-26 Martin Marek , Dongkyu Cho , Shikai Qiu , Rumi Chunara , Pavel Izmailov , Andrew Gordon Wilson

What Training Data Teaches RL Memory Agents: An Empirical Study of Curriculum Effects in Memory-Augmented QA

Reinforcement learning (RL) has emerged as a viable recipe for training LLM agents to reason over external memory banks in multi-session dialogue. Existing work trains exclusively on a single benchmark, leaving open how the composition of…

Computation and Language · Computer Science 2026-05-25 Xinjie He , Zhiyuan Lin , Su Liu , Jialun Wu , Qiyang Xie , Weikai Zhou , Shuai Xiao

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

The incorporation of memory into agents is essential for numerous tasks within the domain of Reinforcement Learning (RL). In particular, memory is paramount for tasks that require the use of past information, adaptation to novel…

Machine Learning · Computer Science 2026-03-05 Egor Cherepanov , Nikita Kachaev , Artem Zholus , Alexey K. Kovalev , Aleksandr I. Panov

On Memory Mechanism in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) extends (single-agent) reinforcement learning (RL) by introducing additional agents and (potentially) partial observability of the environment. Consequently, algorithms for solving MARL problems…

Multiagent Systems · Computer Science 2019-09-12 Yilun Zhou , Derrik E. Asher , Nicholas R. Waytowich , Julie A. Shah

Expert-Free Online Transfer Learning in Multi-Agent Reinforcement Learning

Reinforcement Learning (RL) enables an intelligent agent to optimise its performance in a task by continuously taking action from an observed state and receiving a feedback from the environment in form of rewards. RL typically uses tables…

Artificial Intelligence · Computer Science 2025-01-28 Alberto Castagna

Hierarchical Reinforcement Learning as a Model of Human Task Interleaving

How do people decide how long to continue in a task, when to switch, and to which other task? Understanding the mechanisms that underpin task interleaving is a long-standing goal in the cognitive sciences. Prior work suggests greedy…

Artificial Intelligence · Computer Science 2020-01-08 Christoph Gebhardt , Antti Oulasvirta , Otmar Hilliges

Contrastive Explanations for Comparing Preferences of Reinforcement Learning Agents

In complex tasks where the reward function is not straightforward and consists of a set of objectives, multiple reinforcement learning (RL) policies that perform task adequately, but employ different strategies can be trained by adjusting…

Artificial Intelligence · Computer Science 2021-12-20 Jasmina Gajcin , Rahul Nair , Tejaswini Pedapati , Radu Marinescu , Elizabeth Daly , Ivana Dusparic

AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers

Language models trained on diverse datasets unlock generalization by in-context learning. Reinforcement Learning (RL) policies can achieve a similar effect by meta-learning within the memory of a sequence model. However, meta-RL research…

Machine Learning · Computer Science 2024-11-19 Jake Grigsby , Justin Sasek , Samyak Parajuli , Daniel Adebi , Amy Zhang , Yuke Zhu

A Task-Efficient Reinforcement Learning Task-Motion Planner for Safe Human-Robot Cooperation

In a Human-Robot Cooperation (HRC) environment, safety and efficiency are the two core properties to evaluate robot performance. However, safety mechanisms usually hinder task efficiency since human intervention will cause backup motions…

Robotics · Computer Science 2025-10-15 Gaoyuan Liu , Joris de Winter , Kelly Merckaert , Denis Steckelmacher , Ann Nowe , Bram Vanderborght

What is Going on Inside Recurrent Meta Reinforcement Learning Agents?

Recurrent meta reinforcement learning (meta-RL) agents are agents that employ a recurrent neural network (RNN) for the purpose of "learning a learning algorithm". After being trained on a pre-specified task distribution, the learned weights…

Machine Learning · Computer Science 2021-05-03 Safa Alver , Doina Precup

Unveiling Over-Memorization in Finetuning LLMs for Reasoning Tasks

The pretrained large language models (LLMs) are finetuned with labeled data for better instruction following ability and alignment with human values. In this paper, we study the learning dynamics of LLM finetuning on reasoning tasks and…

Computation and Language · Computer Science 2025-09-30 Zhiwen Ruan , Yun Chen , Yutao Hou , Peng Li , Yang Liu , Guanhua Chen