English

Solving Continuous Control with Episodic Memory

Machine Learning 2021-06-17 v1

Abstract

Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for discrete action problems in terms of sample-efficiency. The application of episodic memory for continuous control with a large action space is not trivial. Our study aims to answer the question: can episodic memory be used to improve agent's performance in continuous control? Our proposed algorithm combines episodic memory with Actor-Critic architecture by modifying critic's objective. We further improve performance by introducing episodic-based replay buffer prioritization. We evaluate our algorithm on OpenAI gym domains and show greater sample-efficiency compared with the state-of-the art model-free off-policy algorithms.

Keywords

Cite

@article{arxiv.2106.08832,
  title  = {Solving Continuous Control with Episodic Memory},
  author = {Igor Kuznetsov and Andrey Filchenkov},
  journal= {arXiv preprint arXiv:2106.08832},
  year   = {2021}
}

Comments

To appear in the 30th International Joint Conference on Artificial Intelligence (IJCAI 2021)

R2 v1 2026-06-24T03:16:14.829Z