Related papers: Two-Memory Reinforcement Learning

Efficient Replay Memory Architectures in Multi-Agent Reinforcement Learning for Traffic Congestion Control

Episodic control, inspired by the role of episodic memory in the human brain, has been shown to improve the sample inefficiency of model-free reinforcement learning by reusing high-return past experiences. However, the memory growth of…

Systems and Control · Electrical Eng. & Systems 2024-07-24 Mukul Chodhary , Kevin Octavian , SooJean Han

Neural Episodic Control

Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of magnitudes more data than humans to achieve reasonable performance. We propose…

Machine Learning · Computer Science 2017-03-07 Alexander Pritzel , Benigno Uria , Sriram Srinivasan , Adrià Puigdomènech , Oriol Vinyals , Demis Hassabis , Daan Wierstra , Charles Blundell

REMem: Reasoning with Episodic Memory in Language Agent

Humans excel at remembering concrete experiences along spatiotemporal contexts and performing reasoning across those events, i.e., the capacity for episodic memory. In contrast, memory in language agents remains mainly semantic, and current…

Artificial Intelligence · Computer Science 2026-03-03 Yiheng Shu , Saisri Padmaja Jonnalagedda , Xiang Gao , Bernal Jiménez Gutiérrez , Weijian Qi , Kamalika Das , Huan Sun , Yu Su

Generalizable Episodic Memory for Deep Reinforcement Learning

Episodic memory-based methods can rapidly latch onto past successful strategies by a non-parametric memory and improve sample efficiency of traditional reinforcement learning. However, little effort is put into the continuous domain, where…

Machine Learning · Computer Science 2021-06-14 Hao Hu , Jianing Ye , Guangxiang Zhu , Zhizhou Ren , Chongjie Zhang

Memento 2: Learning by Stateful Reflective Memory

We present a theoretical study of continual and experiential learning in large language model agents that combine episodic memory with reinforcement learning. We argue that the key mechanism for continual adaptation, without updating model…

Artificial Intelligence · Computer Science 2026-01-30 Jun Wang

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh

Solving Continuous Control with Episodic Memory

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for…

Machine Learning · Computer Science 2021-06-17 Igor Kuznetsov , Andrey Filchenkov

Generating Explanations from Deep Reinforcement Learning Using Episodic Memory

Deep Reinforcement Learning (RL) involves the use of Deep Neural Networks (DNNs) to make sequential decisions in order to maximize reward. For many tasks the resulting sequence of actions produced by a Deep RL policy can be long and…

Artificial Intelligence · Computer Science 2022-07-26 Sam Blakeman , Denis Mareschal

MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory

The hallmark of human intelligence is the self-evolving ability to master new skills by learning from past experiences. However, current AI agents struggle to emulate this self-evolution: fine-tuning is computationally expensive and prone…

Computation and Language · Computer Science 2026-02-13 Shengtao Zhang , Jiaqian Wang , Ruiwen Zhou , Junwei Liao , Yuchen Feng , Zhuo Li , Yujie Zheng , Weinan Zhang , Ying Wen , Zhiyu Li , Feiyu Xiong , Yutao Qi , Bo Tang , Muning Wen

A Machine with Short-Term, Episodic, and Semantic Memory Systems

Inspired by the cognitive science theory of the explicit human memory systems, we have modeled an agent with short-term, episodic, and semantic memory systems, each of which is modeled with a knowledge graph. To evaluate this system and…

Artificial Intelligence · Computer Science 2026-05-19 Taewoon Kim , Michael Cochez , Vincent François-Lavet , Mark Neerincx , Piek Vossen

Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

In cooperative multi-agent reinforcement learning (MARL), agents aim to achieve a common goal, such as defeating enemies or scoring a goal. Existing MARL algorithms are effective but still require significant learning time and often get…

Machine Learning · Computer Science 2024-03-08 Hyungho Na , Yunkyeong Seo , Il-chul Moon

Retrieval-Augmented Reinforcement Learning

Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive,…

Machine Learning · Computer Science 2022-05-25 Anirudh Goyal , Abram L. Friesen , Andrea Banino , Theophane Weber , Nan Rosemary Ke , Adria Puigdomenech Badia , Arthur Guez , Mehdi Mirza , Peter C. Humphreys , Ksenia Konyushkova , Laurent Sifre , Michal Valko , Simon Osindero , Timothy Lillicrap , Nicolas Heess , Charles Blundell

Generalization of Reinforcement Learners with Working and Episodic Memory

Memory is an important aspect of intelligence and plays a role in many deep reinforcement learning models. However, little progress has been made in understanding when specific memory systems help more than others and how well they…

Machine Learning · Computer Science 2020-02-20 Meire Fortunato , Melissa Tan , Ryan Faulkner , Steven Hansen , Adrià Puigdomènech Badia , Gavin Buttimore , Charlie Deck , Joel Z Leibo , Charles Blundell

Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling

Episodic memory is a psychology term which refers to the ability to recall specific events from the past. We suggest one advantage of this particular type of memory is the ability to easily assign credit to a specific state when remembered…

Machine Learning · Computer Science 2018-06-05 Kenny J. Young , Richard S. Sutton , Shuo Yang

Continuous Episodic Control

Non-parametric episodic memory can be used to quickly latch onto high-rewarded experience in reinforcement learning tasks. In contrast to parametric deep reinforcement learning approaches in which reward signals need to be back-propagated…

Machine Learning · Computer Science 2023-04-25 Zhao Yang , Thomas M. Moerland , Mike Preuss , Aske Plaat

Position: Episodic Memory is the Missing Piece for Long-Term LLM Agents

As Large Language Models (LLMs) evolve from text-completion tools into fully fledged agents operating in dynamic environments, they must address the challenge of continually learning and retaining long-term knowledge. Many biological…

Artificial Intelligence · Computer Science 2025-02-12 Mathis Pink , Qinyuan Wu , Vy Ai Vo , Javier Turek , Jianing Mu , Alexander Huth , Mariya Toneva

Continual and Multi-task Reinforcement Learning With Shared Episodic Memory

Episodic memory plays an important role in the behavior of animals and humans. It allows the accumulation of information about current state of the environment in a task-agnostic way. This episodic representation can be later accessed by…

Neural and Evolutionary Computing · Computer Science 2019-05-08 Artyom Y. Sorokin , Mikhail S. Burtsev

Multi-Agent Deep Reinforcement Learning with Human Strategies

Deep learning has enabled traditional reinforcement learning methods to deal with high-dimensional problems. However, one of the disadvantages of deep reinforcement learning methods is the limited exploration capacity of learning agents. In…

Machine Learning · Computer Science 2019-07-30 Thanh Nguyen , Ngoc Duy Nguyen , Saeid Nahavandi

DeltaMem: Towards Agentic Memory Management via Reinforcement Learning

Recent advances in persona-centric memory have revealed the powerful capability of multi-agent systems in managing persona memory, especially in conversational scenarios. However, these complex frameworks often suffer from information loss…

Computation and Language · Computer Science 2026-04-03 Qi Zhang , Shen Huang , Chu Liu , Shouqing Yang , Junbo Zhao , Haobo Wang , Pengjun Xie

Episodic Memory Deep Q-Networks

Reinforcement learning (RL) algorithms have made huge progress in recent years by leveraging the power of deep neural networks (DNN). Despite the success, deep RL algorithms are known to be sample inefficient, often requiring many rounds of…

Machine Learning · Computer Science 2018-05-22 Zichuan Lin , Tianqi Zhao , Guangwen Yang , Lintao Zhang