Related papers: Measuring Progress in Deep Reinforcement Learning …

Search-Based Adversarial Estimates for Improving Sample Efficiency in Off-Policy Reinforcement Learning

Sample inefficiency is a long-lasting challenge in deep reinforcement learning (DRL). Despite dramatic improvements have been made, the problem is far from being solved and is especially challenging in environments with sparse or delayed…

Machine Learning · Computer Science 2025-06-17 Federico Malato , Ville Hautamaki

Posterior Sampling for Deep Reinforcement Learning

Despite remarkable successes, deep reinforcement learning algorithms remain sample inefficient: they require an enormous amount of trial and error to find good policies. Model-based algorithms promise sample efficiency by building an…

Machine Learning · Computer Science 2023-05-19 Remo Sasso , Michelangelo Conserva , Paulo Rauber

Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation

Continuous-time reinforcement learning (CTRL) provides a principled framework for sequential decision-making in environments where interactions evolve continuously over time. Despite its empirical success, the theoretical understanding of…

Machine Learning · Computer Science 2025-05-22 Runze Zhao , Yue Yu , Adams Yiyue Zhu , Chen Yang , Dongruo Zhou

Mastering Atari Games with Limited Data

Reinforcement learning has achieved great success in many applications. However, sample efficiency remains a key challenge, with prominent methods requiring millions (or even billions) of environment steps to train. Recently, there has been…

Machine Learning · Computer Science 2021-12-14 Weirui Ye , Shaohuai Liu , Thanard Kurutach , Pieter Abbeel , Yang Gao

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Deep reinforcement learning (DRL) has significantly advanced the field of combinatorial optimization (CO). However, its practicality is hindered by the necessity for a large number of reward evaluations, especially in scenarios involving…

Machine Learning · Computer Science 2024-07-18 Hyeonah Kim , Minsu Kim , Sungsoo Ahn , Jinkyoo Park

Sample-Efficient Neurosymbolic Deep Reinforcement Learning

Reinforcement Learning (RL) is a well-established framework for sequential decision-making in complex environments. However, state-of-the-art Deep RL (DRL) algorithms typically require large training datasets and often struggle to…

Artificial Intelligence · Computer Science 2026-04-13 Celeste Veronese , Alessandro Farinelli , Daniele Meli

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing their relative performance on a large suite of tasks. Most published results on deep RL benchmarks compare point estimates of aggregate performance such as…

Machine Learning · Computer Science 2022-01-06 Rishabh Agarwal , Max Schwarzer , Pablo Samuel Castro , Aaron Courville , Marc G. Bellemare

Accelerated Methods for Deep Reinforcement Learning

Deep reinforcement learning (RL) has achieved many recent successes, yet experiment turn-around time remains a key bottleneck in research and in practice. We investigate how to optimize existing deep RL algorithms for modern computers,…

Machine Learning · Computer Science 2019-01-14 Adam Stooke , Pieter Abbeel

Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive robot control, but its current application to robotics is currently hindered by high sample requirements. We propose two novel data augmentation techniques for DRL in…

Artificial Intelligence · Computer Science 2019-11-18 Yijiong Lin , Jiancong Huang , Matthieu Zimmer , Juan Rojas , Paul Weng

Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks

The growing computational demands of deep reinforcement learning (DRL) have raised concerns about the environmental and economic costs of training large-scale models. While algorithmic efficiency in terms of learning performance has been…

Machine Learning · Computer Science 2025-09-08 Jason Gardner , Ayan Dutta , Swapnoneel Roy , O. Patrick Kreidl , Ladislau Boloni

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning

One of the notorious issues for Reinforcement Learning (RL) is poor sample efficiency. Compared to single agent RL, the sample efficiency for Multi-Agent Reinforcement Learning (MARL) is more challenging because of its inherent partial…

Machine Learning · Computer Science 2024-04-16 Linjie Xu , Zichuan Liu , Alexander Dockhorn , Diego Perez-Liebana , Jinyu Wang , Lei Song , Jiang Bian

Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations

The recent breakthroughs of deep reinforcement learning (DRL) technique in Alpha Go and playing Atari have set a good example in handling large state and actions spaces of complicated control problems. The DRL technique is comprised of (i)…

Artificial Intelligence · Computer Science 2017-10-12 Hongjia Li , Tianshu Wei , Ao Ren , Qi Zhu , Yanzhi Wang

Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic Environments

Deep Reinforcement Learning (DRL) has achieved remarkable success in complex sequential decision-making tasks, such as playing Atari 2600 games and mastering board games. A critical yet underexplored aspect of DRL is the temporal scale of…

Machine Learning · Computer Science 2025-07-02 Abhishek Verma , Nallarasan V , Balaraman Ravindran

Experience Replay with Random Reshuffling

Experience replay is a key component in reinforcement learning for stabilizing learning and improving sample efficiency. Its typical implementation samples transitions with replacement from a replay buffer. In contrast, in supervised…

Machine Learning · Computer Science 2025-12-05 Yasuhiro Fujita

Dynamic Sparse Training for Deep Reinforcement Learning

Deep reinforcement learning (DRL) agents are trained through trial-and-error interactions with the environment. This leads to a long training time for dense neural networks to achieve good performance. Hence, prohibitive computation and…

Machine Learning · Computer Science 2022-05-09 Ghada Sokar , Elena Mocanu , Decebal Constantin Mocanu , Mykola Pechenizkiy , Peter Stone

Combining Trained Models in Reinforcement Learning

Deep reinforcement learning (DRL) has delivered strong results in domains such as Atari and Go, but it still suffers from high sample cost and weak transfer beyond the training setting. A common response is to reuse information from…

Machine Learning · Computer Science 2026-05-05 Ujjwal Patil , Javad Ghofrani

Towards Sample-Efficiency and Generalization of Transfer and Inverse Reinforcement Learning: A Comprehensive Literature Review

Reinforcement learning (RL) is a sub-domain of machine learning, mainly concerned with solving sequential decision-making problems by a learning agent that interacts with the decision environment to improve its behavior through the reward…

Machine Learning · Computer Science 2025-09-23 Hossein Hassani , Ehsan Hallaji , Roozbeh Razavi-Far , Mehrdad Saif , Liang Lin

Pretraining Representations for Data-Efficient Reinforcement Learning

Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning…

Machine Learning · Computer Science 2021-06-10 Max Schwarzer , Nitarshan Rajkumar , Michael Noukhovitch , Ankesh Anand , Laurent Charlin , Devon Hjelm , Philip Bachman , Aaron Courville

Deep Reinforcement Learning With Macro-Actions

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation…

Machine Learning · Computer Science 2016-06-16 Ishan P. Durugkar , Clemens Rosenbaum , Stefan Dernbach , Sridhar Mahadevan

Stable deep reinforcement learning method by predicting uncertainty in rewards as a subtask

In recent years, a variety of tasks have been accomplished by deep reinforcement learning (DRL). However, when applying DRL to tasks in a real-world environment, designing an appropriate reward is difficult. Rewards obtained via actual…

Machine Learning · Computer Science 2023-10-04 Kanata Suzuki , Tetsuya Ogata