English
Related papers

Related papers: Transformers are Sample-Efficient World Models

200 papers

Model-based reinforcement learning agents utilizing transformers have shown improved sample efficiency due to their ability to model extended context, resulting in more accurate world models. However, for complex reasoning and planning…

Machine Learning · Computer Science 2024-06-04 Pranav Agarwal , Sheldon Andrews , Samira Ebrahimi Kahou

Scaling up deep Reinforcement Learning (RL) methods presents a significant challenge. Following developments in generative modelling, model-based RL positions itself as a strong contender. Recent advances in sequence modelling have led to…

Machine Learning · Computer Science 2024-06-28 Vincent Micheli , Eloi Alonso , François Fleuret

Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While…

Machine Learning · Computer Science 2022-02-15 Danijar Hafner , Timothy Lillicrap , Mohammad Norouzi , Jimmy Ba

Deep neural networks have been successful in many reinforcement learning settings. However, compared to human learners they are overly data hungry. To build a sample-efficient world model, we apply a transformer to real-world episodes in an…

Machine Learning · Computer Science 2023-03-14 Jan Robine , Marc Höftmann , Tobias Uelwer , Stefan Harmeling

Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments. These approaches begin by constructing a parameterized simulation world model of the real environment through…

Machine Learning · Computer Science 2023-12-27 Weipu Zhang , Gang Wang , Jian Sun , Yetian Yuan , Gao Huang

Much of recent Deep Reinforcement Learning success is owed to the neural architecture's potential to learn and use effective internal representations of the world. While many current algorithms access a simulator to train with a large…

Artificial Intelligence · Computer Science 2022-02-03 Amir Ardalan Kalantari , Mohammad Amini , Sarath Chandar , Doina Precup

While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an…

Machine Learning · Computer Science 2021-05-21 Max Schwarzer , Ankesh Anand , Rishab Goel , R Devon Hjelm , Aaron Courville , Philip Bachman

Recent work in deep reinforcement learning has allowed algorithms to learn complex tasks such as Atari 2600 games just from the reward provided by the game, but these algorithms presently require millions of training steps in order to…

Machine Learning · Computer Science 2018-01-09 Benjamin Spector , Serge Belongie

Sampled environment transitions are a critical input to deep reinforcement learning (DRL) algorithms. Current DRL benchmarks often allow for the cheap and easy generation of large amounts of samples such that perceived progress in DRL does…

Machine Learning · Computer Science 2021-02-10 Florian E. Dorner

Deep reinforcement learning agents have achieved state-of-the-art results by directly maximising cumulative reward. However, environments contain a much wider variety of possible training signals. In this paper, we introduce an agent that…

One of the biggest challenges to modern deep reinforcement learning (DRL) algorithms is sample efficiency. Many approaches learn a world model in order to train an agent entirely in imagination, eliminating the need for direct environment…

Artificial Intelligence · Computer Science 2024-02-20 Paul Mattes , Rainer Schlosser , Ralf Herbrich

The DreamerV3 algorithm recently obtained remarkable performance across diverse environment domains by learning an accurate world model based on Recurrent Neural Networks (RNNs). Following the success of model-based reinforcement learning…

Machine Learning · Computer Science 2025-05-27 Maxime Burchi , Radu Timofte

Model-free reinforcement learning (RL) can be used to learn effective policies for complex tasks, such as Atari games, even from image observations. However, this typically requires very large amounts of interaction -- substantially more,…

A longstanding goal of the field of AI is a method for learning a highly capable, generalist agent from diverse experience. In the subfields of vision and language, this was largely achieved by scaling up transformer-based models and…

World models learn to simulate environment dynamics from experience, enabling sample-efficient reinforcement learning. But what do these models actually represent internally? We apply interpretability techniques--including linear and…

Machine Learning · Computer Science 2026-03-24 Xinyu Zhang

Text-based games provide an interactive way to study natural language processing. While deep reinforcement learning has shown effectiveness in developing the game playing agent, the low sample efficiency and the large action space remain to…

Computation and Language · Computer Science 2022-04-22 Yunqiu Xu , Meng Fang , Ling Chen , Yali Du , Joey Tianyi Zhou , Chengqi Zhang

Intelligent agents need a physical understanding of the world to predict the impact of their actions in the future. While learning-based models of the environment dynamics have contributed to significant improvements in sample efficiency…

Machine Learning · Computer Science 2020-05-20 Eric Heiden , David Millard , Hejia Zhang , Gaurav S. Sukhatme

This paper presents an actor-critic deep reinforcement learning agent with experience replay that is stable, sample efficient, and performs remarkably well on challenging environments, including the discrete 57-game Atari domain and several…

Machine Learning · Computer Science 2017-07-11 Ziyu Wang , Victor Bapst , Nicolas Heess , Volodymyr Mnih , Remi Munos , Koray Kavukcuoglu , Nando de Freitas

In order perform a large variety of tasks and to achieve human-level performance in complex real-world environments, Artificial Intelligence (AI) Agents must be able to learn from their past experiences and gain both knowledge and an…

Machine Learning · Computer Science 2019-05-13 Andrei Claudiu Roibu

Data efficiency is a key challenge for deep reinforcement learning. We address this problem by using unlabeled data to pretrain an encoder which is then finetuned on a small amount of task-specific data. To encourage learning…

‹ Prev 1 2 3 10 Next ›