Related papers: Data-Efficient Reinforcement Learning with Self-Pr…

Pretraining Representations for Data-Efficient Reinforcement Learning

Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning…

Machine Learning · Computer Science 2021-06-10 Max Schwarzer , Nitarshan Rajkumar , Michael Noukhovitch , Ankesh Anand , Laurent Charlin , Devon Hjelm , Philip Bachman , Aaron Courville

Pre-training with Non-expert Human Demonstration for Deep Reinforcement Learning

Deep reinforcement learning (deep RL) has achieved superior performance in complex sequential tasks by using deep neural networks as function approximators to learn directly from raw input images. However, learning directly from raw images…

Machine Learning · Computer Science 2019-07-31 Gabriel V. de la Cruz , Yunshu Du , Matthew E. Taylor

Reinforcement Learning with Unsupervised Auxiliary Tasks

Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that…

Machine Learning · Computer Science 2016-11-17 Max Jaderberg , Volodymyr Mnih , Wojciech Marian Czarnecki , Tom Schaul , Joel Z Leibo , David Silver , Koray Kavukcuoglu

Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization

Advances in reinforcement learning (RL) often rely on massive compute resources and remain notoriously sample inefficient. In contrast, the human brain is able to efficiently learn effective control strategies using limited resources. This…

Machine Learning · Computer Science 2024-01-30 Burcu Küçükoğlu , Walraaf Borkent , Bodo Rueckauer , Nasir Ahmad , Umut Güçlü , Marcel van Gerven

Ego-Foresight: Self-supervised Learning of Agent-Aware Representations for Improved RL

Despite the significant advances in Deep Reinforcement Learning (RL) observed in the last decade, the amount of training experience necessary to learn effective policies remains one of the primary concerns in both simulated and real…

Robotics · Computer Science 2026-04-02 Manuel Serra Nunes , Atabak Dehban , Yiannis Demiris , José Santos-Victor

An Empirical Study on the Power of Future Prediction in Partially Observable Environments

Learning good representations of historical contexts is one of the core challenges of reinforcement learning (RL) in partially observable environments. While self-predictive auxiliary tasks have been shown to improve performance in fully…

Machine Learning · Computer Science 2025-03-11 Jeongyeol Kwon , Liu Yang , Robert Nowak , Josiah Hanna

Reward prediction for representation learning and reward shaping

One of the fundamental challenges in reinforcement learning (RL) is the one of data efficiency: modern algorithms require a very large number of training samples, especially compared to humans, for solving environments with high-dimensional…

Machine Learning · Computer Science 2021-05-10 Hlynur Davíð Hlynsson , Laurenz Wiskott

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Deep reinforcement learning (RL) algorithms suffer severe performance degradation when the interaction data is scarce, which limits their real-world application. Recently, visual representation learning has been shown to be effective and…

Machine Learning · Computer Science 2022-08-17 Yang Yue , Bingyi Kang , Zhongwen Xu , Gao Huang , Shuicheng Yan

Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning

Deep reinforcement learning (DRL) gives the promise that an agent learns good policy from high-dimensional information, whereas representation learning removes irrelevant and redundant information and retains pertinent information. In this…

Machine Learning · Computer Science 2023-04-25 Qiang He , Huangyuan Su , Jieyu Zhang , Xinwen Hou

Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to…

Machine Learning · Computer Science 2018-01-09 Benjamin Spector , Serge Belongie

Learning sparse representations in reinforcement learning

Reinforcement learning (RL) algorithms allow artificial agents to improve their selection of actions to increase rewarding experiences in their environments. Temporal Difference (TD) Learning -- a model-free RL method -- is a leading…

Machine Learning · Computer Science 2019-09-05 Jacob Rafati , David C. Noelle

Autonomous Reinforcement Learning via Subgoal Curricula

Reinforcement learning (RL) promises to enable autonomous acquisition of complex behaviors for diverse agents. However, the success of current reinforcement learning algorithms is predicated on an often under-emphasised requirement -- each…

Machine Learning · Computer Science 2021-10-29 Archit Sharma , Abhishek Gupta , Sergey Levine , Karol Hausman , Chelsea Finn

Reinforcement Learning with Prototypical Representations

Learning effective representations in image-based environments is crucial for sample efficient Reinforcement Learning (RL). Unfortunately, in RL, representation learning is confounded with the exploratory experience of the agent -- learning…

Machine Learning · Computer Science 2021-07-21 Denis Yarats , Rob Fergus , Alessandro Lazaric , Lerrel Pinto

Simplifying Deep Reinforcement Learning via Self-Supervision

Supervised regression to demonstrations has been demonstrated to be a stable way to train deep policy networks. We are motivated to study how we can take full advantage of supervised loss functions for stably training deep reinforcement…

Machine Learning · Computer Science 2021-06-11 Daochen Zha , Kwei-Herng Lai , Kaixiong Zhou , Xia Hu

Unsupervised State Representation Learning in Atari

State representation learning, or the ability to capture latent generative factors of an environment, is crucial for building intelligent agents that can perform a wide variety of tasks. Learning such representations without supervision…

Machine Learning · Computer Science 2020-11-09 Ankesh Anand , Evan Racah , Sherjil Ozair , Yoshua Bengio , Marc-Alexandre Côté , R Devon Hjelm

Model-Based Reinforcement Learning for Atari

Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more,…

Machine Learning · Computer Science 2024-04-04 Lukasz Kaiser , Mohammad Babaeizadeh , Piotr Milos , Blazej Osinski , Roy H Campbell , Konrad Czechowski , Dumitru Erhan , Chelsea Finn , Piotr Kozakowski , Sergey Levine , Afroz Mohiuddin , Ryan Sepassi , George Tucker , Henryk Michalewski

Multi-scale Predictive Representations for Goal-conditioned Reinforcement Learning

This paper investigates robust representation learning in offline goal-conditioned reinforcement learning (GCRL). Particularly in sparse reward scenarios, learning representations that align state and goal latents is a challenge that…

Machine Learning · Computer Science 2026-05-12 Valliappan Chidambaram Adaikkappan , David Meger , Sai Rajeswar , Pietro Mazzaglia

Dynamics-aware Embeddings

In this paper we consider self-supervised representation learning to improve sample efficiency in reinforcement learning (RL). We propose a forward prediction objective for simultaneously learning embeddings of states and action sequences.…

Machine Learning · Computer Science 2020-01-15 William Whitney , Rajat Agarwal , Kyunghyun Cho , Abhinav Gupta

Playing FPS Games with Deep Reinforcement Learning

Advances in deep reinforcement learning have allowed autonomous agents to perform well on Atari games, often outperforming humans, using only raw pixels to make their decisions. However, most of these games take place in 2D environments…

Artificial Intelligence · Computer Science 2018-01-30 Guillaume Lample , Devendra Singh Chaplot

Toward Training Superintelligent Software Agents through Self-Play SWE-RL

While current software agents powered by large language models (LLMs) and agentic reinforcement learning (RL) can boost programmer productivity, their training data (e.g., GitHub issues and pull requests) and environments (e.g.,…

Software Engineering · Computer Science 2026-05-20 Yuxiang Wei , Zhiqing Sun , Emily McMilin , Jonas Gehring , David Zhang , Gabriel Synnaeve , Daniel Fried , Lingming Zhang , Sida Wang