Related papers: Discrete Codebook World Models for Continuous Cont…

Discrete World Models via Regularization

World models aim to capture the states and dynamics of an environment in a compact latent space. Moreover, using Boolean state representations is particularly useful for search heuristics and symbolic reasoning and planning. Existing…

Machine Learning · Computer Science 2026-03-03 Davide Bizzaro , Luciano Serafini

Harnessing Discrete Representations For Continual Reinforcement Learning

Reinforcement learning (RL) agents make decisions using nothing but observations from the environment, and consequently, heavily rely on the representations of those observations. Though some recent breakthroughs have used vector-based…

Machine Learning · Computer Science 2024-07-16 Edan Meyer , Adam White , Marlos C. Machado

Cycle-Consistent World Models for Domain Independent Latent Imagination

End-to-end autonomous driving seeks to solve the perception, decision, and control problems in an integrated way, which can be easier to generalize at scale and be more adapting to new scenarios. However, high costs and risks make it very…

Machine Learning · Computer Science 2022-06-08 Sidney Bender , Tim Joseph , Marius Zoellner

TD-MPC2: Scalable, Robust World Models for Continuous Control

TD-MPC is a model-based reinforcement learning (RL) algorithm that performs local trajectory optimization in the latent space of a learned implicit (decoder-free) world model. In this work, we present TD-MPC2: a series of improvements upon…

Machine Learning · Computer Science 2024-03-22 Nicklas Hansen , Hao Su , Xiaolong Wang

When Object-Centric World Models Meet Policy Learning: From Pixels to Policies, and Where It Breaks

Object-centric world models (OCWM) aim to decompose visual scenes into object-level representations, providing structured abstractions that could improve compositional generalization and data efficiency in reinforcement learning. We…

Artificial Intelligence · Computer Science 2025-11-12 Stefano Ferraro , Akihiro Nakano , Masahiro Suzuki , Yutaka Matsuo

DDP-WM: Disentangled Dynamics Prediction for Efficient World Models

World models are essential for autonomous robotic planning. However, the substantial computational overhead of existing dense Transformerbased models significantly hinders real-time deployment. To address this efficiency-performance…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Shicheng Yin , Kaixuan Yin , Weixing Chen , Yang Liu , Guanbin Li , Liang Lin

ResWM: Residual-Action World Model for Visual RL

Learning predictive world models from raw visual observations is a central challenge in reinforcement learning (RL), especially for robotics and continuous control. Conventional model-based RL frameworks directly condition future…

Robotics · Computer Science 2026-03-13 Jseen Zhang , Gabriel Adineera , Jinzhou Tan , Jinoh Kim

Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning

Unsupervised pre-training methods utilizing large and diverse datasets have achieved tremendous success across a range of domains. Recent work has investigated such unsupervised pre-training methods for model-based reinforcement learning…

Computer Vision and Pattern Recognition · Computer Science 2023-10-30 Jialong Wu , Haoyu Ma , Chaoyi Deng , Mingsheng Long

Latent Action World Models for Control with Unlabeled Trajectories

Inspired by how humans combine direct interaction with action-free experience (e.g., videos), we study world models that learn from heterogeneous data. Standard world models typically rely on action-conditioned trajectories, which limits…

Machine Learning · Computer Science 2025-12-12 Marvin Alles , Xingyuan Zhang , Patrick van der Smagt , Philip Becker-Ehmck

The Effectiveness of World Models for Continual Reinforcement Learning

World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically…

Machine Learning · Computer Science 2023-07-14 Samuel Kessler , Mateusz Ostaszewski , Michał Bortkiewicz , Mateusz Żarski , Maciej Wołczyk , Jack Parker-Holder , Stephen J. Roberts , Piotr Miłoś

Deep Learning Alternative to Explicit Model Predictive Control for Unknown Nonlinear Systems

We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state-space model is learned from…

Systems and Control · Electrical Eng. & Systems 2021-07-27 Jan Drgona , Karol Kis , Aaron Tuor , Draguna Vrabie , Martin Klauco

Dream-MPC: Gradient-Based Model Predictive Control with Latent Imagination

State-of-the-art model-based Reinforcement Learning (RL) approaches either use gradient-free, population-based methods for planning, learned policy networks, or a combination of policy networks and planning. Hybrid approaches that combine…

Machine Learning · Computer Science 2026-05-25 Jonathan Spieler , Sven Behnke

World Models as Reference Trajectories for Rapid Motor Adaptation

Deploying learned control policies in real-world environments poses a fundamental challenge. When system dynamics change unexpectedly, performance degrades until models are retrained on new data. We introduce Reflexive World Models (RWM), a…

Machine Learning · Computer Science 2025-05-22 Carlos Stein Brito , Daniel McNamee

Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models

World model-based policy evaluation is a practical proxy for testing real-world robot control by rolling out candidate actions in action-conditioned video diffusion models. As these models increasingly adopt latent diffusion modeling (LDM),…

Computer Vision and Pattern Recognition · Computer Science 2026-05-08 Nilaksh , Saurav Jha , Artem Zholus , Sarath Chandar

Discrete Control in Real-World Driving Environments using Deep Reinforcement Learning

Training self-driving cars is often challenging since they require a vast amount of labeled data in multiple real-world contexts, which is computationally and memory intensive. Researchers often resort to driving simulators to train the…

Artificial Intelligence · Computer Science 2022-12-01 Avinash Amballa , Advaith P. , Pradip Sasmal , Sumohana Channappayya

Coevolutionary Continuous Discrete Diffusion: Make Your Diffusion Language Model a Latent Reasoner

Diffusion language models, especially masked discrete diffusion models, have achieved great success recently. While there are some theoretical and primary empirical results showing the advantages of latent reasoning with looped transformers…

Artificial Intelligence · Computer Science 2026-05-13 Cai Zhou , Chenxiao Yang , Yi Hu , Chenyu Wang , Chubin Zhang , Muhan Zhang , Lester Mackey , Tommi Jaakkola , Stephen Bates , Dinghuai Zhang

DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions

Diffusion-based world models have demonstrated strong capabilities in synthesizing realistic long-horizon trajectories for offline reinforcement learning (RL). However, many existing methods do not directly generate actions alongside states…

Machine Learning · Computer Science 2026-05-14 Zongyue Li , Xiao Han , Yusong Li , Niklas Strauss , Matthias Schubert

PWM: Policy Learning with Multi-Task World Models

Reinforcement Learning (RL) has made significant strides in complex tasks but struggles in multi-task settings with different embodiments. World model methods offer scalability by learning a simulation of the environment but often rely on…

Machine Learning · Computer Science 2025-02-25 Ignat Georgiev , Varun Giridhar , Nicklas Hansen , Animesh Garg

Temporal Predictive Coding For Model-Based Planning In Latent Space

High-dimensional observations are a major challenge in the application of model-based reinforcement learning (MBRL) to real-world environments. To handle high-dimensional sensory inputs, existing approaches use representation learning to…

Machine Learning · Computer Science 2021-06-15 Tung Nguyen , Rui Shu , Tuan Pham , Hung Bui , Stefano Ermon

Language-conditioned world model improves policy generalization by reading environmental descriptions

To interact effectively with humans in the real world, it is important for agents to understand language that describes the dynamics of the environment--that is, how the environment behaves--rather than just task instructions specifying…

Computation and Language · Computer Science 2025-12-01 Anh Nguyen , Stefan Lee