Related papers: Transformers are Sample-Efficient World Models

Learning to Play Atari in a World of Tokens

Model-based reinforcement learning agents utilizing transformers have shown improved sample efficiency due to their ability to model extended context, resulting in more accurate world models. However, for complex reasoning and planning…

Machine Learning · Computer Science 2024-06-04 Pranav Agarwal , Sheldon Andrews , Samira Ebrahimi Kahou

Efficient World Models with Context-Aware Tokenization

Scaling up deep Reinforcement Learning (RL) methods presents a significant challenge. Following developments in generative modelling, model-based RL positions itself as a strong contender. Recent advances in sequence modelling have led to…

Machine Learning · Computer Science 2024-06-28 Vincent Micheli , Eloi Alonso , François Fleuret

Mastering Atari with Discrete World Models

Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While…

Machine Learning · Computer Science 2022-02-15 Danijar Hafner , Timothy Lillicrap , Mohammad Norouzi , Jimmy Ba

Transformer-based World Models Are Happy With 100k Interactions

Deep neural networks have been successful in many reinforcement learning settings. However, compared to human learners they are overly data hungry. To build a sample-efficient world model, we apply a transformer to real-world episodes in an…

Machine Learning · Computer Science 2023-03-14 Jan Robine , Marc Höftmann , Tobias Uelwer , Stefan Harmeling

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments. These approaches begin by constructing a parameterized simulation world model of the real environment through…

Machine Learning · Computer Science 2023-12-27 Weipu Zhang , Gang Wang , Jian Sun , Yetian Yuan , Gao Huang

Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers

Much of recent Deep Reinforcement Learning success is owed to the neural architecture's potential to learn and use effective internal representations of the world. While many current algorithms access a simulator to train with a large…

Artificial Intelligence · Computer Science 2022-02-03 Amir Ardalan Kalantari , Mohammad Amini , Sarath Chandar , Doina Precup

Data-Efficient Reinforcement Learning with Self-Predictive Representations

While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an…

Machine Learning · Computer Science 2021-05-21 Max Schwarzer , Ankesh Anand , Rishab Goel , R Devon Hjelm , Aaron Courville , Philip Bachman

Sample-Efficient Reinforcement Learning through Transfer and Architectural Priors

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to…

Machine Learning · Computer Science 2018-01-09 Benjamin Spector , Serge Belongie

Measuring Progress in Deep Reinforcement Learning Sample Efficiency

Sampled environment transitions are a critical input to deep reinforcement learning (DRL) algorithms. Current DRL benchmarks often allow for the cheap and easy generation of large amounts of samples such that perceived progress in DRL does…

Machine Learning · Computer Science 2021-02-10 Florian E. Dorner

Reinforcement Learning with Unsupervised Auxiliary Tasks

Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that…

Machine Learning · Computer Science 2016-11-17 Max Jaderberg , Volodymyr Mnih , Wojciech Marian Czarnecki , Tom Schaul , Joel Z Leibo , David Silver , Koray Kavukcuoglu

Hieros: Hierarchical Imagination on Structured State Space Sequence World Models

One of the biggest challenges to modern deep reinforcement learning (DRL) algorithms is sample efficiency. Many approaches learn a world model in order to train an agent entirely in imagination, eliminating the need for direct environment…

Artificial Intelligence · Computer Science 2024-02-20 Paul Mattes , Rainer Schlosser , Ralf Herbrich

Learning Transformer-based World Models with Contrastive Predictive Coding

The DreamerV3 algorithm recently obtained remarkable performance across diverse environment domains by learning an accurate world model based on Recurrent Neural Networks (RNNs). Following the success of model-based reinforcement learning…

Machine Learning · Computer Science 2025-05-27 Maxime Burchi , Radu Timofte

Model-Based Reinforcement Learning for Atari

Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more,…

Machine Learning · Computer Science 2024-04-04 Lukasz Kaiser , Mohammad Babaeizadeh , Piotr Milos , Blazej Osinski , Roy H Campbell , Konrad Czechowski , Dumitru Erhan , Chelsea Finn , Piotr Kozakowski , Sergey Levine , Afroz Mohiuddin , Ryan Sepassi , George Tucker , Henryk Michalewski

Multi-Game Decision Transformers

A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and…

Artificial Intelligence · Computer Science 2022-10-18 Kuang-Huei Lee , Ofir Nachum , Mengjiao Yang , Lisa Lee , Daniel Freeman , Winnie Xu , Sergio Guadarrama , Ian Fischer , Eric Jang , Henryk Michalewski , Igor Mordatch

What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators

World models learn to simulate environment dynamics from experience, enabling sample-efficient reinforcement learning. But what do these models actually represent internally? We apply interpretability techniques--including linear and…

Machine Learning · Computer Science 2026-03-24 Xinyu Zhang

Perceiving the World: Question-guided Reinforcement Learning for Text-based Games

Text-based games provide an interactive way to study natural language processing. While deep reinforcement learning has shown effectiveness in developing the game playing agent, the low sample efficiency and the large action space remain to…

Computation and Language · Computer Science 2022-04-22 Yunqiu Xu , Meng Fang , Ling Chen , Yali Du , Joey Tianyi Zhou , Chengqi Zhang

Interactive Differentiable Simulation

Intelligent agents need a physical understanding of the world to predict the impact of their actions in the future. While learning-based models of the environment dynamics have contributed to significant improvements in sample efficiency…

Machine Learning · Computer Science 2020-05-20 Eric Heiden , David Millard , Hejia Zhang , Gaurav S. Sukhatme

Sample Efficient Actor-Critic with Experience Replay

This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several…

Machine Learning · Computer Science 2017-07-11 Ziyu Wang , Victor Bapst , Nicolas Heess , Volodymyr Mnih , Remi Munos , Koray Kavukcuoglu , Nando de Freitas

Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning

In order perform a large variety of tasks and to achieve human-level performance in complex real-world environments, Artificial Intelligence (AI) Agents must be able to learn from their past experiences and gain both knowledge and an…

Machine Learning · Computer Science 2019-05-13 Andrei Claudiu Roibu

Pretraining Representations for Data-Efficient Reinforcement Learning

Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning…

Machine Learning · Computer Science 2021-06-10 Max Schwarzer , Nitarshan Rajkumar , Michael Noukhovitch , Ankesh Anand , Laurent Charlin , Devon Hjelm , Philip Bachman , Aaron Courville