Related papers: Virtual Replay Cache

Reconciling $\lambda$-Returns with Experience Replay

Modern deep reinforcement learning methods have departed from the incremental learning required for eligibility traces, rendering the implementation of the $\lambda$-return difficult in this context. In particular, off-policy methods that…

Machine Learning · Computer Science 2020-01-15 Brett Daley , Christopher Amato

Deep Reinforcement Learning With Macro-Actions

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation…

Machine Learning · Computer Science 2016-06-16 Ishan P. Durugkar , Clemens Rosenbaum , Stefan Dernbach , Sridhar Mahadevan

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Deep reinforcement learning (RL) algorithms suffer severe performance degradation when the interaction data is scarce, which limits their real-world application. Recently, visual representation learning has been shown to be effective and…

Machine Learning · Computer Science 2022-08-17 Yang Yue , Bingyi Kang , Zhongwen Xu , Gao Huang , Shuicheng Yan

RT-Cache: Training-Free Retrieval for Real-Time Manipulation

Real robots are expected to repeat the same behavior in new environments with very little new data, yet modern controllers either incur heavy per-step inference or require deployment-time fine-tuning. We propose RT-Cache, a training-free…

Robotics · Computer Science 2025-08-26 Owen Kwon , Abraham George , Alison Bartsch , Amir Barati Farimani

Pseudo-Rehearsal: Achieving Deep Reinforcement Learning without Catastrophic Forgetting

Neural networks can achieve excellent results in a wide variety of applications. However, when they attempt to sequentially learn, they tend to learn the new task while catastrophically forgetting previous ones. We propose a model that…

Machine Learning · Computer Science 2020-12-18 Craig Atkinson , Brendan McCane , Lech Szymanski , Anthony Robins

Reverse Experience Replay

This paper describes an improvement in Deep Q-learning called Reverse Experience Replay (also RER) that solves the problem of sparse rewards and helps to deal with reward maximizing tasks by sampling transitions successively in reverse…

Machine Learning · Computer Science 2019-10-24 Egor Rotinov

Prioritized Experience Replay

Experience replay lets online reinforcement learning agents remember and reuse experiences from the past. In prior work, experience transitions were uniformly sampled from a replay memory. However, this approach simply replays transitions…

Machine Learning · Computer Science 2016-02-26 Tom Schaul , John Quan , Ioannis Antonoglou , David Silver

The Effectiveness of Memory Replay in Large Scale Continual Learning

We study continual learning in the large scale setting where tasks in the input sequence are not limited to classification, and the outputs can be of high dimension. Among multiple state-of-the-art methods, we found vanilla experience…

Machine Learning · Computer Science 2020-10-07 Yogesh Balaji , Mehrdad Farajtabar , Dong Yin , Alex Mott , Ang Li

VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference

This paper presents VLCache, a cache reuse framework that exploits both Key-Value (KV) cache and encoder cache from prior multimodal inputs to eliminate costly recomputation when the same multimodal inputs recur. Unlike previous heuristic…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Shengling Qin , Hao Yu , Chenxin Wu , Zheng Li , Yizhong Cao , Zhengyang Zhuge , Yuxin Zhou , Wentao Yao , Yi Zhang , Zhengheng Wang , Shuai Bai , Jianwei Zhang , Junyang Lin

Inverse reinforcement learning for video games

Deep reinforcement learning achieves superhuman performance in a range of video game environments, but requires that a designer manually specify a reward function. It is often easier to provide demonstrations of a target behavior than to…

Machine Learning · Computer Science 2018-10-26 Aaron Tucker , Adam Gleave , Stuart Russell

Distributional Reinforcement Learning for Efficient Exploration

In distributional reinforcement learning (RL), the estimated distribution of value function models both the parametric and intrinsic uncertainties. We propose a novel and efficient exploration method for deep RL that has two components. The…

Machine Learning · Computer Science 2019-05-16 Borislav Mavrin , Shangtong Zhang , Hengshuai Yao , Linglong Kong , Kaiwen Wu , Yaoliang Yu

Variance Reduction via Resampling and Experience Replay

Experience replay is a foundational technique in reinforcement learning that enhances learning stability by storing past experiences in a replay buffer and reusing them during training. Despite its practical success, its theoretical…

Machine Learning · Statistics 2025-11-14 Jiale Han , Xiaowu Dai , Yuhua Zhu

Return-Based Contrastive Representation Learning for Reinforcement Learning

Recently, various auxiliary tasks have been proposed to accelerate representation learning and improve sample efficiency in deep reinforcement learning (RL). However, existing auxiliary tasks do not take the characteristics of RL problems…

Machine Learning · Computer Science 2021-02-23 Guoqing Liu , Chuheng Zhang , Li Zhao , Tao Qin , Jinhua Zhu , Jian Li , Nenghai Yu , Tie-Yan Liu

Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning

Deep Reinforcement Learning agents often suffer from catastrophic forgetting, forgetting previously found solutions in parts of the input space when training on new data. Replay Memories are a common solution to the problem, decorrelating…

Machine Learning · Computer Science 2023-08-29 Muhammad Burhan Hafez , Tilman Immisch , Tom Weber , Stefan Wermter

Experience Replay with Random Reshuffling

Experience replay is a key component in reinforcement learning for stabilizing learning and improving sample efficiency. Its typical implementation samples transitions with replacement from a replay buffer. In contrast, in supervised…

Machine Learning · Computer Science 2025-12-05 Yasuhiro Fujita

Viewport-Aware Deep Reinforcement Learning Approach for 360$^o$ Video Caching

360$^o$ video is an essential component of VR/AR/MR systems that provides immersive experience to the users. However, 360$^o$ video is associated with high bandwidth requirements. The required bandwidth can be reduced by exploiting the fact…

Multimedia · Computer Science 2020-04-14 Pantelis Maniotis , Nikolaos Thomos

Towards VM Rescheduling Optimization Through Deep Reinforcement Learning

Modern industry-scale data centers need to manage a large number of virtual machines (VMs). Due to the continual creation and release of VMs, many small resource fragments are scattered across physical machines (PMs). To handle these…

Machine Learning · Computer Science 2025-05-26 Xianzhong Ding , Yunkai Zhang , Binbin Chen , Donghao Ying , Tieying Zhang , Jianjun Chen , Lei Zhang , Alberto Cerpa , Wan Du

Memory-efficient Reinforcement Learning with Value-based Knowledge Consolidation

Artificial neural networks are promising for general function approximation but challenging to train on non-independent or non-identically distributed data due to catastrophic forgetting. The experience replay buffer, a standard component…

Machine Learning · Computer Science 2023-04-12 Qingfeng Lan , Yangchen Pan , Jun Luo , A. Rupam Mahmood

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Deep Reinforcement Learning (RL) methods rely on experience replay to approximate the minibatched supervised learning setting; however, unlike supervised learning where access to lots of training data is crucial to generalization,…

Machine Learning · Computer Science 2021-02-24 Brett Daley , Cameron Hickert , Christopher Amato

Value Prediction Network

This paper proposes a novel deep reinforcement learning (RL) architecture, called Value Prediction Network (VPN), which integrates model-free and model-based RL methods into a single neural network. In contrast to typical model-based RL…

Artificial Intelligence · Computer Science 2017-11-08 Junhyuk Oh , Satinder Singh , Honglak Lee