English
Related papers

Related papers: Continuous Episodic Control

200 papers

Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inefficiency of standard deep reinforcement learning approaches. Using non-/semi-parametric models to estimate the value function, they learn…

Machine Learning · Computer Science 2019-11-22 Andrea Agostinelli , Kai Arulkumaran , Marta Sarrico , Pierre Richemond , Anil Anthony Bharath

End-to-end deep reinforcement learning has enabled agents to learn with little preprocessing by humans. However, it is still difficult to learn stably and efficiently because the learning method usually uses a nonlinear function…

Machine Learning · Computer Science 2019-04-16 Daichi Nishio , Satoshi Yamane

State of the art deep reinforcement learning algorithms are sample inefficient due to the large number of episodes they require to achieve asymptotic performance. Episodic Reinforcement Learning (ERL) algorithms, inspired by the mammalian…

Machine Learning · Computer Science 2024-06-07 Ismael T. Freire , Adrián F. Amil , Paul F. M. J. Verschure

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for…

Machine Learning · Computer Science 2021-06-17 Igor Kuznetsov , Andrey Filchenkov

Reinforcement learning (RL) has driven breakthroughs in AI, from game-play to scientific discovery and AI alignment. However, its broader applicability remains limited by challenges such as low data efficiency and poor generalizability.…

Artificial Intelligence · Computer Science 2025-06-03 Xidong Yang , Wenhao Li , Junjie Sheng , Chuyun Shen , Yun Hua , Xiangfeng Wang

Episodic control provides a highly sample-efficient method for reinforcement learning while enforcing high memory and computational requirements. This work proposes a simple heuristic for reducing these requirements, and an application to…

Machine Learning · Computer Science 2020-08-25 Rafael Pinto

Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our…

Machine Learning · Computer Science 2021-11-09 Hung Le , Thommen Karimpanal George , Majid Abdolshah , Truyen Tran , Svetha Venkatesh

Deep reinforcement learning methods attain super-human performance in a wide range of environments. Such methods are grossly inefficient, often taking orders of magnitudes more data than humans to achieve reasonable performance. We propose…

State of the art deep reinforcement learning algorithms take many millions of interactions to attain human-level performance. Humans, on the other hand, can very quickly exploit highly rewarding nuances of an environment upon first…

Deep Deterministic Policy Gradient (DDPG) has been proved to be a successful reinforcement learning (RL) algorithm for continuous control tasks. However, DDPG still suffers from data insufficiency and training inefficiency, especially in…

Machine Learning · Computer Science 2019-03-05 Zhizheng Zhang , Jiale Chen , Zhibo Chen , Weiping Li

Episodic memory plays an important role in the behavior of animals and humans. It allows the accumulation of information about current state of the environment in a task-agnostic way. This episodic representation can be later accessed by…

Neural and Evolutionary Computing · Computer Science 2019-05-08 Artyom Y. Sorokin , Mikhail S. Burtsev

We present a new algorithm, Cross-Episodic Curriculum (CEC), to boost the learning efficiency and generalization of Transformer agents. Central to CEC is the placement of cross-episodic experiences into a Transformer's context, which forms…

Machine Learning · Computer Science 2023-10-13 Lucy Xiaoyang Shi , Yunfan Jiang , Jake Grigsby , Linxi "Jim" Fan , Yuke Zhu

Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods:…

Machine Learning · Computer Science 2019-11-22 Marta Sarrico , Kai Arulkumaran , Andrea Agostinelli , Pierre Richemond , Anil Anthony Bharath

Episodic control, inspired by the role of episodic memory in the human brain, has been shown to improve the sample inefficiency of model-free reinforcement learning by reusing high-return past experiences. However, the memory growth of…

Systems and Control · Electrical Eng. & Systems 2024-07-24 Mukul Chodhary , Kevin Octavian , SooJean Han

Episodic control has been proposed as a third approach to reinforcement learning, besides model-free and model-based control, by analogy with the three types of human memory. i.e. episodic, procedural and semantic memory. But the…

Artificial Intelligence · Computer Science 2018-08-13 Johanni Brea

Sequential recommendation, where user preference is dynamically inferred from sequential historical behaviors, is a critical task in recommender systems (RSs). To further optimize long-term user engagement, offline…

Machine Learning · Computer Science 2024-08-16 Jun Wang , Likang Wu , Qi Liu , Yu Yang

Episodic memory-based methods can rapidly latch onto past successful strategies by a non-parametric memory and improve sample efficiency of traditional reinforcement learning. However, little effort is put into the continuous domain, where…

Machine Learning · Computer Science 2021-06-14 Hao Hu , Jianing Ye , Guangxiang Zhu , Zhizhou Ren , Chongjie Zhang

In continual learning (CL), an agent learns from a stream of tasks leveraging prior experience to transfer knowledge to future tasks. It is an ideal framework to decrease the amount of supervision in the existing learning algorithms. But…

Episodic self-imitation learning, a novel self-imitation algorithm with a trajectory selection module and an adaptive loss function, is proposed to speed up reinforcement learning. Compared to the original self-imitation learning algorithm,…

Artificial Intelligence · Computer Science 2020-11-30 Tianhong Dai , Hengyan Liu , Anil Anthony Bharath

The sample-inefficiency problem in Artificial Intelligence refers to the inability of current Deep Reinforcement Learning models to optimize action policies within a small number of episodes. Recent studies have tried to overcome this…

Artificial Intelligence · Computer Science 2020-12-29 Ismael T. Freire , Adrián F. Amil , Vasiliki Vouloutsi , Paul F. M. J. Verschure
‹ Prev 1 2 3 10 Next ›