Related papers: Discrete Codebook World Models for Continuous Cont…

Learning Constrained Adaptive Differentiable Predictive Control Policies With Guarantees

We present differentiable predictive control (DPC), a method for learning constrained neural control policies for linear systems with probabilistic performance guarantees. We employ automatic differentiation to obtain direct policy…

Systems and Control · Electrical Eng. & Systems 2022-01-28 Jan Drgona , Aaron Tuor , Draguna Vrabie

Point Cloud Models Improve Visual Robustness in Robotic Learners

Visual control policies can encounter significant performance degradation when visual conditions like lighting or camera position differ from those seen during training -- often exhibiting sharp declines in capability even for minor…

Robotics · Computer Science 2024-04-30 Skand Peri , Iain Lee , Chanho Kim , Li Fuxin , Tucker Hermans , Stefan Lee

DyMoDreamer: World Modeling with Dynamic Modulation

A critical bottleneck in deep reinforcement learning (DRL) is sample inefficiency, as training high-performance agents often demands extensive environmental interactions. Model-based reinforcement learning (MBRL) mitigates this by building…

Machine Learning · Computer Science 2025-09-30 Boxuan Zhang , Runqing Wang , Wei Xiao , Weipu Zhang , Jian Sun , Gao Huang , Jie Chen , Gang Wang

Contextual Latent World Models for Offline Meta Reinforcement Learning

Offline meta-reinforcement learning seeks to learn policies that generalize across related tasks from fixed datasets. Context-based methods infer a task representation from transition histories, but learning effective task representations…

Machine Learning · Computer Science 2026-03-04 Mohammadreza Nakheai , Aidan Scannell , Kevin Luck , Joni Pajarinen

World Modelling Improves Language Model Agents

Tool use in stateful environments presents unique challenges for large language models (LLMs), where existing test-time compute strategies relying on repeated trials in the environment are impractical. We propose dynamics modelling (DyMo),…

Artificial Intelligence · Computer Science 2025-09-22 Shangmin Guo , Omar Darwiche Domingues , Raphaël Avalos , Aaron Courville , Florian Strub

World Models for Anomaly Detection during Model-Based Reinforcement Learning Inference

Learning-based controllers are often purposefully kept out of real-world applications due to concerns about their safety and reliability. We explore how state-of-the-art world models in Model-Based Reinforcement Learning can be utilized…

Robotics · Computer Science 2025-03-05 Fabian Domberg , Georg Schildbach

DR-MPC: Deep Residual Model Predictive Control for Real-world Social Navigation

How can a robot safely navigate around people with complex motion patterns? Deep Reinforcement Learning (DRL) in simulation holds some promise, but much prior work relies on simulators that fail to capture the nuances of real human motion.…

Robotics · Computer Science 2025-02-17 James R. Han , Hugues Thomas , Jian Zhang , Nicholas Rhinehart , Timothy D. Barfoot

Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning

Humans leverage rich internal models of the world to reason about the future, imagine counterfactuals, and adapt flexibly to new situations. In Reinforcement Learning (RL), world models aim to capture how the environment evolves in response…

Artificial Intelligence · Computer Science 2025-10-29 Léopold Maytié , Roland Bertin Johannet , Rufin VanRullen

Temporal Difference Learning for Model Predictive Control

Data-driven model predictive control has two key advantages over model-free methods: a potential for improved sample efficiency through model learning, and better performance as computational budget for planning increases. However, it is…

Machine Learning · Computer Science 2022-07-21 Nicklas Hansen , Xiaolong Wang , Hao Su

Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning

Recent studies have shown that Transformers can perform in-context reinforcement learning (RL) by imitating existing RL algorithms, enabling sample-efficient adaptation to unseen tasks without parameter updates. However, these models also…

Machine Learning · Computer Science 2025-02-27 Jaehyeon Son , Soochan Lee , Gunhee Kim

Latent feedback control of distributed systems in multiple scenarios through deep learning-based reduced order models

Continuous monitoring and real-time control of high-dimensional distributed systems are often crucial in applications to ensure a desired physical behavior, without degrading stability and system performances. Traditional feedback control…

Optimization and Control · Mathematics 2024-12-16 Matteo Tomasetto , Francesco Braghin , Andrea Manzoni

Accelerating Model-Based Reinforcement Learning with State-Space World Models

Reinforcement learning (RL) is a powerful approach for robot learning. However, model-free RL (MFRL) requires a large number of environment interactions to learn successful control policies. This is due to the noisy RL training updates and…

Robotics · Computer Science 2025-02-28 Maria Krinner , Elie Aljalbout , Angel Romero , Davide Scaramuzza

Model-Based Reinforcement Learning with Isolated Imaginations

World models learn the consequences of actions in vision-based interactive systems. However, in practical scenarios like autonomous driving, noncontrollable dynamics that are independent or sparsely dependent on action signals often exist,…

Machine Learning · Computer Science 2023-11-20 Minting Pan , Xiangming Zhu , Yitao Zheng , Yunbo Wang , Xiaokang Yang

Continual Learning Using World Models for Pseudo-Rehearsal

The utility of learning a dynamics/world model of the environment in reinforcement learning has been shown in a many ways. When using neural networks, however, these models suffer catastrophic forgetting when learned in a lifelong or…

Machine Learning · Computer Science 2019-06-12 Nicholas Ketz , Soheil Kolouri , Praveen Pilly

Back to Parsimonious Latents: Learning Task-Centric World Models from Visual Foundations

World models enable agents to predict future dynamics conditioned on actions, making the choice of latent representation central to planning and control. Such representations are often either learned directly from pixels with limited…

Artificial Intelligence · Computer Science 2026-05-26 Minghao Fu , Fan Feng , Nicklas Hansen , Biwei Huang

InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Control

Model Predictive Control (MPC) is a powerful control strategy widely utilized in domains like energy management, building control, and autonomous systems. However, its effectiveness in real-world settings is challenged by the need to…

Systems and Control · Electrical Eng. & Systems 2025-09-08 Ruixiang Wu , Jiahao Ai , Tongxin Li

Compositional Discrete Latent Code for High Fidelity, Productive Diffusion Models

We argue that diffusion models' success in modeling complex distributions is, for the most part, coming from their input conditioning. This paper investigates the representation used to condition diffusion models from the perspective that…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Samuel Lavoie , Michael Noukhovitch , Aaron Courville

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as…

Machine Learning · Computer Science 2021-12-03 Nitish Srivastava , Walter Talbott , Martin Bertran Lopez , Shuangfei Zhai , Josh Susskind

Parallelized Robust Distributed Model Predictive Control in the Presence of Coupled State Constraints

In this paper, we present a robust distributed model predictive control (DMPC) scheme for dynamically decoupled nonlinear systems which are subject to state constraints, coupled state constraints and input constraints. In the proposed…

Systems and Control · Electrical Eng. & Systems 2024-10-07 Adrian Wiltz , Fei Chen , Dimos V. Dimarogonas

World Models for Autonomous Navigation of Terrestrial Robots from LIDAR Observations

Autonomous navigation of terrestrial robots using Reinforcement Learning (RL) from LIDAR observations remains challenging due to the high dimensionality of sensor data and the sample inefficiency of model-free approaches. Conventional…

Robotics · Computer Science 2025-12-04 Raul Steinmetz , Fabio Demo Rosa , Victor Augusto Kich , Jair Augusto Bottega , Ricardo Bedin Grando , Daniel Fernando Tello Gamarra