Related papers: Continuous Episodic Control

Memory-Efficient Episodic Control Reinforcement Learning with Dynamic Online k-means

Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inefficiency of standard deep reinforcement learning approaches. Using non-/semi-parametric models to estimate the value function, they learn…

Machine Learning · Computer Science 2019-11-22 Andrea Agostinelli , Kai Arulkumaran , Marta Sarrico , Pierre Richemond , Anil Anthony Bharath

Random Projection in Neural Episodic Control

End-to-end deep reinforcement learning has enabled agents to learn with little preprocessing by humans. However, it is still difficult to learn stably and efficiently because the learning method usually uses a nonlinear function…

Machine Learning · Computer Science 2019-04-16 Daichi Nishio , Satoshi Yamane

Sequential memory improves sample and memory efficiency in Episodic Control

State of the art deep reinforcement learning algorithms are sample inefficient due to the large number of episodes they require to achieve asymptotic performance. Episodic Reinforcement Learning (ERL) algorithms, inspired by the mammalian…

Machine Learning · Computer Science 2024-06-07 Ismael T. Freire , Adrián F. Amil , Paul F. M. J. Verschure

Solving Continuous Control with Episodic Memory

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for…

Machine Learning · Computer Science 2021-06-17 Igor Kuznetsov , Andrey Filchenkov

Agentic Episodic Control

Reinforcement learning (RL) has driven breakthroughs in AI, from game-play to scientific discovery and AI alignment. However, its broader applicability remains limited by challenges such as low data efficiency and poor generalizability.…

Artificial Intelligence · Computer Science 2025-06-03 Xidong Yang , Wenhao Li , Junjie Sheng , Chuyun Shen , Yun Hua , Xiangfeng Wang

Model-Free Episodic Control with State Aggregation

Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to…

Machine Learning · Computer Science 2020-08-25 Rafael Pinto

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh

Neural Episodic Control

Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of magnitudes more data than humans to achieve reasonable performance. We propose…

Machine Learning · Computer Science 2017-03-07 Alexander Pritzel , Benigno Uria , Sriram Srinivasan , Adrià Puigdomènech , Oriol Vinyals , Demis Hassabis , Daan Wierstra , Charles Blundell

Model-Free Episodic Control

State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance. Humans, on the other hand, can very quickly exploit highly rewarding nuances of an environment upon first…

Machine Learning · Statistics 2016-06-15 Charles Blundell , Benigno Uria , Alexander Pritzel , Yazhe Li , Avraham Ruderman , Joel Z Leibo , Jack Rae , Daan Wierstra , Demis Hassabis

Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments

Deep Deterministic Policy Gradient (DDPG) has been proved to be a successful reinforcement learning (RL) algorithm for continuous control tasks. However, DDPG still suffers from data insufficiency and training inefficiency, especially in…

Machine Learning · Computer Science 2019-03-05 Zhizheng Zhang , Jiale Chen , Zhibo Chen , Weiping Li

Continual and Multi-task Reinforcement Learning With Shared Episodic Memory

Episodic memory plays an important role in the behavior of animals and humans. It allows the accumulation of information about current state of the environment in a task-agnostic way. This episodic representation can be later accessed by…

Neural and Evolutionary Computing · Computer Science 2019-05-08 Artyom Y. Sorokin , Mikhail S. Burtsev

Cross-Episodic Curriculum for Transformer Agents

We present a new algorithm, Cross-Episodic Curriculum (CEC), to boost the learning efficiency and generalization of Transformer agents. Central to CEC is the placement of cross-episodic experiences into a Transformer's context, which forms…

Machine Learning · Computer Science 2023-10-13 Lucy Xiaoyang Shi , Yunfan Jiang , Jake Grigsby , Linxi "Jim" Fan , Yuke Zhu

Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control

Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods:…

Machine Learning · Computer Science 2019-11-22 Marta Sarrico , Kai Arulkumaran , Andrea Agostinelli , Pierre Richemond , Anil Anthony Bharath

Efficient Replay Memory Architectures in Multi-Agent Reinforcement Learning for Traffic Congestion Control

Episodic control, inspired by the role of episodic memory in the human brain, has been shown to improve the sample inefficiency of model-free reinforcement learning by reusing high-return past experiences. However, the memory growth of…

Systems and Control · Electrical Eng. & Systems 2024-07-24 Mukul Chodhary , Kevin Octavian , SooJean Han

Is prioritized sweeping the better episodic control?

Episodic control has been proposed as a third approach to reinforcement learning, besides model-free and model-based control, by analogy with the three types of human memory. i.e. episodic, procedural and semantic memory. But the…

Artificial Intelligence · Computer Science 2018-08-13 Johanni Brea

An Efficient Continuous Control Perspective for Reinforcement-Learning-based Sequential Recommendation

Sequential recommendation, where user preference is dynamically inferred from sequential historical behaviors, is a critical task in recommender systems (RSs). To further optimize long-term user engagement, offline…

Machine Learning · Computer Science 2024-08-16 Jun Wang , Likang Wu , Qi Liu , Yu Yang

Generalizable Episodic Memory for Deep Reinforcement Learning

Episodic memory-based methods can rapidly latch onto past successful strategies by a non-parametric memory and improve sample efficiency of traditional reinforcement learning. However, little effort is put into the continuous domain, where…

Machine Learning · Computer Science 2021-06-14 Hao Hu , Jianing Ye , Guangxiang Zhu , Zhizhou Ren , Chongjie Zhang

On Tiny Episodic Memories in Continual Learning

In continual learning (CL), an agent learns from a stream of tasks leveraging prior experience to transfer knowledge to future tasks. It is an ideal framework to decrease the amount of supervision in the existing learning algorithms. But…

Machine Learning · Computer Science 2019-06-05 Arslan Chaudhry , Marcus Rohrbach , Mohamed Elhoseiny , Thalaiyasingam Ajanthan , Puneet K. Dokania , Philip H. S. Torr , Marc'Aurelio Ranzato

Episodic Self-Imitation Learning with Hindsight

Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm,…

Artificial Intelligence · Computer Science 2020-11-30 Tianhong Dai , Hengyan Liu , Anil Anthony Bharath

Towards sample-efficient episodic control with DAC-ML

The sample-inefficiency problem in Artificial Intelligence refers to the inability of current Deep Reinforcement Learning models to optimize action policies within a small number of episodes. Recent studies have tried to overcome this…

Artificial Intelligence · Computer Science 2020-12-29 Ismael T. Freire , Adrián F. Amil , Vasiliki Vouloutsi , Paul F. M. J. Verschure