Related papers: Learning Transformer-based World Models with Contr…

Mastering Atari with Discrete World Models

Intelligent agents need to generalize from past experience to achieve goals in complex environments. World models facilitate such generalization and allow learning behaviors from imagined outcomes to increase sample-efficiency. While…

Machine Learning · Computer Science 2022-02-15 Danijar Hafner , Timothy Lillicrap , Mohammad Norouzi , Jimmy Ba

MuDreamer: Learning Predictive World Models without Reconstruction

The DreamerV3 agent recently demonstrated state-of-the-art performance in diverse domains, learning powerful world models in latent space using a pixel reconstruction loss. However, while the reconstruction loss is essential to Dreamer's…

Artificial Intelligence · Computer Science 2024-05-27 Maxime Burchi , Radu Timofte

Transformer-based World Models Are Happy With 100k Interactions

Deep neural networks have been successful in many reinforcement learning settings. However, compared to human learners they are overly data hungry. To build a sample-efficient world model, we apply a transformer to real-world episodes in an…

Machine Learning · Computer Science 2023-03-14 Jan Robine , Marc Höftmann , Tobias Uelwer , Stefan Harmeling

STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning

Recently, model-based reinforcement learning algorithms have demonstrated remarkable efficacy in visual input environments. These approaches begin by constructing a parameterized simulation world model of the real environment through…

Machine Learning · Computer Science 2023-12-27 Weipu Zhang , Gang Wang , Jian Sun , Yetian Yuan , Gao Huang

CURLing the Dream: Contrastive Representations for World Modeling in Reinforcement Learning

In this work, we present Curled-Dreamer, a novel reinforcement learning algorithm that integrates contrastive learning into the DreamerV3 framework to enhance performance in visual reinforcement learning tasks. By incorporating the…

Machine Learning · Computer Science 2024-09-04 Victor Augusto Kich , Jair Augusto Bottega , Raul Steinmetz , Ricardo Bedin Grando , Ayano Yorozu , Akihisa Ohya

DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations

Top-performing Model-Based Reinforcement Learning (MBRL) agents, such as Dreamer, learn the world model by reconstructing the image observations. Hence, they often fail to discard task-irrelevant details and struggle to handle visual…

Machine Learning · Computer Science 2021-10-28 Fei Deng , Ingook Jang , Sungjin Ahn

Transformers are Sample-Efficient World Models

Deep reinforcement learning agents are notoriously sample inefficient, which considerably limits their application to real-world problems. Recently, many model-based methods have been designed to address this issue, with learning in the…

Machine Learning · Computer Science 2023-03-02 Vincent Micheli , Eloi Alonso , François Fleuret

World Model Robustness via Surprise Recognition

AI systems deployed in the real world must contend with distractions and out-of-distribution (OOD) noise that can destabilize their policies and lead to unsafe behavior. While robust training can reduce sensitivity to some forms of noise,…

Machine Learning · Computer Science 2025-12-02 Geigh Zollicoffer , Tanush Chopra , Mingkuan Yan , Xiaoxu Ma , Kenneth Eaton , Mark Riedl

TransDreamer: Reinforcement Learning with Transformer World Models

The Dreamer agent provides various benefits of Model-Based Reinforcement Learning (MBRL) such as sample efficiency, reusable knowledge, and safe planning. However, its world model and policy networks inherit the limitations of recurrent…

Machine Learning · Computer Science 2024-11-20 Chang Chen , Yi-Fu Wu , Jaesik Yoon , Sungjin Ahn

DayDreamer: World Models for Physical Robot Learning

To solve tasks in complex environments, robots need to learn from experience. Deep reinforcement learning is a common approach to robot learning but requires a large amount of trial and error to learn, limiting its deployment in the…

Robotics · Computer Science 2022-06-29 Philipp Wu , Alejandro Escontrela , Danijar Hafner , Ken Goldberg , Pieter Abbeel

Improving Transformer World Models for Data-Efficient RL

We present three improvements to the standard model-based RL paradigm based on transformers: (a) "Dyna with warmup", which trains the policy on real and imaginary data, but only starts using imaginary data after the world model has been…

Machine Learning · Computer Science 2025-07-18 Antoine Dedieu , Joseph Ortiz , Xinghua Lou , Carter Wendelken , Wolfgang Lehrach , J Swaroop Guntupalli , Miguel Lazaro-Gredilla , Kevin Patrick Murphy

The Effectiveness of World Models for Continual Reinforcement Learning

World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically…

Machine Learning · Computer Science 2023-07-14 Samuel Kessler , Mateusz Ostaszewski , Michał Bortkiewicz , Mateusz Żarski , Maciej Wołczyk , Jack Parker-Holder , Stephen J. Roberts , Piotr Miłoś

TREND: Unsupervised 3D Representation Learning via Temporal Forecasting for LiDAR Perception

Labeling LiDAR point clouds is notoriously time-and-energy-consuming, which spurs recent unsupervised 3D representation learning methods to alleviate the labeling burden in LiDAR perception via pretrained weights. Almost all existing work…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Runjian Chen , Hyoungseob Park , Bo Zhang , Wenqi Shao , Ping Luo , Alex Wong

RLVR-World: Training World Models with Reinforcement Learning

World models predict state transitions in response to actions and are increasingly developed across diverse modalities. However, standard training objectives such as maximum likelihood estimation (MLE) often misalign with task-specific…

Machine Learning · Computer Science 2025-10-28 Jialong Wu , Shaofeng Yin , Ningya Feng , Mingsheng Long

Task Aware Dreamer for Task Generalization in Reinforcement Learning

A long-standing goal of reinforcement learning is to acquire agents that can learn on training tasks and generalize well on unseen tasks that may share a similar dynamic but with different reward functions. The ability to generalize across…

Machine Learning · Computer Science 2026-01-26 Chengyang Ying , Xinning Zhou , Zhongkai Hao , Hang Su , Songming Liu , Dong Yan , Jun Zhu

Dream to Control: Learning Behaviors by Latent Imagination

Learned world models summarize an agent's experience to facilitate learning complex behaviors. While learning world models from high-dimensional sensory inputs is becoming feasible through deep learning, there are many potential ways for…

Machine Learning · Computer Science 2020-03-18 Danijar Hafner , Timothy Lillicrap , Jimmy Ba , Mohammad Norouzi

DreamerV3 for Traffic Signal Control: Hyperparameter Tuning and Performance

Reinforcement learning (RL) has evolved into a widely investigated technology for the development of smart TSC strategies. However, current RL algorithms necessitate excessive interaction with the environment to learn effective policies,…

Machine Learning · Computer Science 2025-03-05 Qiang Li , Yinhan Lin , Qin Luo , Lina Yu

Zero-shot World Models via Search in Memory

World Models have vastly permeated the field of Reinforcement Learning. Their ability to model the transition dynamics of an environment have greatly improved sample efficiency in online RL. Among them, the most notorious example is…

Machine Learning · Computer Science 2025-10-21 Federico Malato , Ville Hautamäki

DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction

The present paper proposes a novel reinforcement learning method with world models, DreamingV2, a collaborative extension of DreamerV2 and Dreaming. DreamerV2 is a cutting-edge model-based reinforcement learning from pixels that uses…

Machine Learning · Computer Science 2022-03-02 Masashi Okada , Tadahiro Taniguchi

ReCoRe: Regularized Contrastive Representation Learning of World Model

While recent model-free Reinforcement Learning (RL) methods have demonstrated human-level effectiveness in gaming environments, their success in everyday tasks like visual navigation has been limited, particularly under significant…

Machine Learning · Computer Science 2024-04-04 Rudra P. K. Poudel , Harit Pandya , Stephan Liwicki , Roberto Cipolla