Related papers: Model-Based Reinforcement Learning with Multi-Task…

Finetuning Offline World Models in the Real World

Reinforcement Learning (RL) is notoriously data-inefficient, which makes training on a real robot difficult. While model-based RL algorithms (world models) improve data-efficiency to some extent, they still require hours or days of…

Machine Learning · Computer Science 2023-10-25 Yunhai Feng , Nicklas Hansen , Ziyan Xiong , Chandramouli Rajagopalan , Xiaolong Wang

A Workflow for Offline Model-Free Robotic Reinforcement Learning

Offline reinforcement learning (RL) enables learning control policies by utilizing only prior experience, without any online interaction. This can allow robots to acquire generalizable skills from large and diverse datasets, without any…

Machine Learning · Computer Science 2021-09-24 Aviral Kumar , Anikait Singh , Stephen Tian , Chelsea Finn , Sergey Levine

On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning

Reinforcement Learning (RL) algorithms can solve challenging control problems directly from image observations, but they often require millions of environment interactions to do so. Recently, model-based RL algorithms have greatly improved…

Machine Learning · Computer Science 2023-06-16 Yifan Xu , Nicklas Hansen , Zirui Wang , Yung-Chieh Chan , Hao Su , Zhuowen Tu

Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning

Offline reinforcement learning (RL) offers a powerful paradigm for data-driven control. Compared to model-free approaches, offline model-based RL (MBRL) explicitly learns a world model from a static dataset and uses it as a surrogate…

Machine Learning · Computer Science 2026-02-02 Jiayu Chen , Le Xu , Aravind Venugopal , Jeff Schneider

Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning

Training offline RL models using visual inputs poses two significant challenges, i.e., the overfitting problem in representation learning and the overestimation bias for expected future rewards. Recent work has attempted to alleviate the…

Machine Learning · Computer Science 2024-10-30 Qi Wang , Junming Yang , Yunbo Wang , Xin Jin , Wenjun Zeng , Xiaokang Yang

RM-RL: Role-Model Reinforcement Learning for Precise Robot Manipulation

Precise robot manipulation is critical for fine-grained applications such as chemical and biological experiments, where even small errors (e.g., reagent spillage) can invalidate an entire task. Existing approaches often rely on…

Robotics · Computer Science 2026-02-13 Xiangyu Chen , Chuhao Zhou , Yuxi Liu , Jianfei Yang

Offline vs. Online Learning in Model-based RL: Lessons for Data Collection Strategies

Data collection is crucial for learning robust world models in model-based reinforcement learning. The most prevalent strategies are to actively collect trajectories by interacting with the environment during online training or training on…

Machine Learning · Computer Science 2025-09-09 Jiaqi Chen , Ji Shi , Cansu Sancaktar , Jonas Frey , Georg Martius

Physics-Informed Model-Based Reinforcement Learning

We apply reinforcement learning (RL) to robotics tasks. One of the drawbacks of traditional RL algorithms has been their poor sample efficiency. One approach to improve the sample efficiency is model-based RL. In our model-based RL…

Machine Learning · Computer Science 2023-05-16 Adithya Ramesh , Balaraman Ravindran

A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control

Reinforcement Learning (RL) is a method for learning decision-making tasks that could enable robots to learn and adapt to their situation on-line. For an RL algorithm to be practical for robotic control tasks, it must learn in very few…

Artificial Intelligence · Computer Science 2015-03-19 Todd Hester , Michael Quinlan , Peter Stone

Multi-Objective Decision Transformers for Offline Reinforcement Learning

Offline Reinforcement Learning (RL) is structured to derive policies from static trajectory data without requiring real-time environment interactions. Recent studies have shown the feasibility of framing offline RL as a sequence modeling…

Machine Learning · Computer Science 2023-09-01 Abdelghani Ghanem , Philippe Ciblat , Mounir Ghogho

WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning

Reinforcement learning (RL) in robotics is often limited by the cost and risk of data collection, motivating experience transfer from a source task to a target task. Offline-to-online RL leverages prior data but typically assumes a given…

Machine Learning · Computer Science 2026-04-16 Mintae Kim , Koushil Sreenath

Offline Meta-Reinforcement Learning with Online Self-Supervision

Meta-reinforcement learning (RL) methods can meta-train policies that adapt to new tasks with orders of magnitude less data than standard RL, but meta-training itself is costly and time-consuming. If we can meta-train on offline data, then…

Machine Learning · Computer Science 2022-07-08 Vitchyr H. Pong , Ashvin Nair , Laura Smith , Catherine Huang , Sergey Levine

Offline Reinforcement Learning Hands-On

Offline Reinforcement Learning (RL) aims to turn large datasets into powerful decision-making engines without any online interactions with the environment. This great promise has motivated a large amount of research that hopes to replicate…

Machine Learning · Computer Science 2020-12-01 Louis Monier , Jakub Kmec , Alexandre Laterre , Thomas Pierrot , Valentin Courgeau , Olivier Sigaud , Karim Beguir

Deploying Offline Reinforcement Learning with Human Feedback

Reinforcement learning (RL) has shown promise for decision-making tasks in real-world applications. One practical framework involves training parameterized policy models from an offline dataset and subsequently deploying them in an online…

Machine Learning · Computer Science 2023-03-14 Ziniu Li , Ke Xu , Liu Liu , Lanqing Li , Deheng Ye , Peilin Zhao

Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble

Recent advance in deep offline reinforcement learning (RL) has made it possible to train strong robotic agents from offline datasets. However, depending on the quality of the trained agents and the application being considered, it is often…

Robotics · Computer Science 2021-11-02 Seunghyun Lee , Younggyo Seo , Kimin Lee , Pieter Abbeel , Jinwoo Shin

Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning

We consider real-world reinforcement learning (RL) of robotic manipulation tasks that involve both visuomotor skills and contact-rich skills. We aim to train a policy that maps multimodal sensory observations (vision and force) to a…

Robotics · Computer Science 2022-03-01 Jun Jin , Daniel Graves , Cameron Haigh , Jun Luo , Martin Jagersand

Can Offline Reinforcement Learning Help Natural Language Understanding?

Pre-training has been a useful method for learning implicit transferable knowledge and it shows the benefit of offering complementary features across different modalities. Recent work mainly focuses on the modalities such as image and text,…

Computation and Language · Computer Science 2022-12-09 Ziqi Zhang , Yile Wang , Yue Zhang , Donglin Wang

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning

Offline reinforcement learning (RL) aims to learn a policy from a static dataset without further interactions with the environment. Collecting sufficiently large datasets for offline RL is exhausting since this data collection requires…

Artificial Intelligence · Computer Science 2025-10-22 Jongchan Park , Mingyu Park , Donghwan Lee

Quantifying Multimodality in World Models

Model-based Deep Reinforcement Learning (RL) assumes the availability of a model of an environment's underlying transition dynamics. This model can be used to predict future effects of an agent's possible actions. When no such model is…

Machine Learning · Computer Science 2021-12-15 Andreas Sedlmeier , Michael Kölle , Robert Müller , Leo Baudrexel , Claudia Linnhoff-Popien

Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data

Leveraging offline data is a promising way to improve the sample efficiency of online reinforcement learning (RL). This paper expands the pool of usable data for offline-to-online RL by leveraging abundant non-curated data that is…

Machine Learning · Computer Science 2025-05-20 Yi Zhao , Aidan Scannell , Wenshuai Zhao , Yuxin Hou , Tianyu Cui , Le Chen , Dieter Büchler , Arno Solin , Juho Kannala , Joni Pajarinen