Related papers: Sequential memory improves sample and memory effic…

Continuous Episodic Control

Non-parametric episodic memory can be used to quickly latch onto high-rewarded experience in reinforcement learning tasks. In contrast to parametric deep reinforcement learning approaches in which reward signals need to be back-propagated…

Machine Learning · Computer Science 2023-04-25 Zhao Yang , Thomas M. Moerland , Mike Preuss , Aske Plaat

Model-Free Episodic Control

State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance. Humans, on the other hand, can very quickly exploit highly rewarding nuances of an environment upon first…

Machine Learning · Statistics 2016-06-15 Charles Blundell , Benigno Uria , Alexander Pritzel , Yazhe Li , Avraham Ruderman , Joel Z Leibo , Jack Rae , Daan Wierstra , Demis Hassabis

Memory-Efficient Episodic Control Reinforcement Learning with Dynamic Online k-means

Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inefficiency of standard deep reinforcement learning approaches. Using non-/semi-parametric models to estimate the value function, they learn…

Machine Learning · Computer Science 2019-11-22 Andrea Agostinelli , Kai Arulkumaran , Marta Sarrico , Pierre Richemond , Anil Anthony Bharath

Towards sample-efficient episodic control with DAC-ML

The sample-inefficiency problem in Artificial Intelligence refers to the inability of current Deep Reinforcement Learning models to optimize action policies within a small number of episodes. Recent studies have tried to overcome this…

Artificial Intelligence · Computer Science 2020-12-29 Ismael T. Freire , Adrián F. Amil , Vasiliki Vouloutsi , Paul F. M. J. Verschure

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh

High-fidelity social learning via shared episodic memories enhances collaborative foraging through mnemonic convergence

Social learning, a cornerstone of cultural evolution, enables individuals to acquire knowledge by observing and imitating others. At the heart of its efficacy lies episodic memory, which encodes specific behavioral sequences to facilitate…

Artificial Intelligence · Computer Science 2024-12-31 Ismael T. Freire , Paul Verschure

State-based Episodic Memory for Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) algorithms have made promising progress in recent years by leveraging the centralized training and decentralized execution (CTDE) paradigm. However, existing MARL algorithms still suffer from the…

Machine Learning · Computer Science 2021-10-20 Xiao Ma , Wu-Jun Li

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

Applying Reinforcement Learning (RL) to sequence generation models enables the direct optimization of long-term rewards (\textit{e.g.,} BLEU and human feedback), but typically requires large-scale sampling over a space of action sequences.…

Computation and Language · Computer Science 2023-08-07 Chenglong Wang , Hang Zhou , Yimin Hu , Yifu Huo , Bei Li , Tongran Liu , Tong Xiao , Jingbo Zhu

Continual and Multi-task Reinforcement Learning With Shared Episodic Memory

Episodic memory plays an important role in the behavior of animals and humans. It allows the accumulation of information about current state of the environment in a task-agnostic way. This episodic representation can be later accessed by…

Neural and Evolutionary Computing · Computer Science 2019-05-08 Artyom Y. Sorokin , Mikhail S. Burtsev

Episodic Reinforcement Learning with Expanded State-reward Space

Empowered by deep neural networks, deep reinforcement learning (DRL) has demonstrated tremendous empirical successes in various domains, including games, health care, and autonomous driving. Despite these advancements, DRL is still…

Machine Learning · Computer Science 2024-01-22 Dayang Liang , Yaru Zhang , Yunlong Liu

Solving Continuous Control with Episodic Memory

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for…

Machine Learning · Computer Science 2021-06-17 Igor Kuznetsov , Andrey Filchenkov

Efficient Replay Memory Architectures in Multi-Agent Reinforcement Learning for Traffic Congestion Control

Episodic control, inspired by the role of episodic memory in the human brain, has been shown to improve the sample inefficiency of model-free reinforcement learning by reusing high-return past experiences. However, the memory growth of…

Systems and Control · Electrical Eng. & Systems 2024-07-24 Mukul Chodhary , Kevin Octavian , SooJean Han

Is prioritized sweeping the better episodic control?

Episodic control has been proposed as a third approach to reinforcement learning, besides model-free and model-based control, by analogy with the three types of human memory. i.e. episodic, procedural and semantic memory. But the…

Artificial Intelligence · Computer Science 2018-08-13 Johanni Brea

Continual Learning with Strong Experience Replay

Continual Learning (CL) aims at incrementally learning new tasks without forgetting the knowledge acquired from old ones. Experience Replay (ER) is a simple and effective rehearsal-based strategy, which optimizes the model with current…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Tao Zhuo , Zhiyong Cheng , Zan Gao , Hehe Fan , Mohan Kankanhalli

Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling

Episodic memory is a psychology term which refers to the ability to recall specific events from the past. We suggest one advantage of this particular type of memory is the ability to easily assign credit to a specific state when remembered…

Machine Learning · Computer Science 2018-06-05 Kenny J. Young , Richard S. Sutton , Shuo Yang

Episodic memory governs choices: An RNN-based reinforcement learning model for decision-making task

Typical methods to study cognitive function are to record the electrical activities of animal neurons during the training of animals performing behavioral tasks. A key problem is that they fail to record all the relevant neurons in the…

Neurons and Cognition · Quantitative Biology 2021-03-08 Xiaohan Zhang , Lu Liu , Guodong Long , Jing Jiang , Shenquan Liu

TEAL: New Selection Strategy for Small Buffers in Experience Replay Class Incremental Learning

Continual Learning is an unresolved challenge, whose relevance increases when considering modern applications. Unlike the human brain, trained deep neural networks suffer from a phenomenon called catastrophic forgetting, wherein they…

Machine Learning · Computer Science 2025-02-18 Shahar Shaul-Ariel , Daphna Weinshall

Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control

Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods:…

Machine Learning · Computer Science 2019-11-22 Marta Sarrico , Kai Arulkumaran , Andrea Agostinelli , Pierre Richemond , Anil Anthony Bharath

Agentic Episodic Control

Reinforcement learning (RL) has driven breakthroughs in AI, from game-play to scientific discovery and AI alignment. However, its broader applicability remains limited by challenges such as low data efficiency and poor generalizability.…

Artificial Intelligence · Computer Science 2025-06-03 Xidong Yang , Wenhao Li , Junjie Sheng , Chuyun Shen , Yun Hua , Xiangfeng Wang

Model-Free Episodic Control with State Aggregation

Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to…

Machine Learning · Computer Science 2020-08-25 Rafael Pinto