Related papers: Multi-timestep models for Model-based Reinforcemen…

A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as the…

Machine Learning · Computer Science 2024-02-06 Abdelhakim Benechehab , Albert Thomas , Giuseppe Paolo , Maurizio Filippone , Balázs Kégl

Model Imitation for Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) aims to learn a dynamic model to reduce the number of interactions with real-world environments. However, due to estimation error, rollouts in the learned model, especially those of long horizons,…

Machine Learning · Computer Science 2020-03-17 Yueh-Hua Wu , Ting-Han Fan , Peter J. Ramadge , Hao Su

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a…

Machine Learning · Computer Science 2022-09-14 Haoxin Lin , Yihao Sun , Jiaji Zhang , Yang Yu

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Towards a Simple Approach to Multi-step Model-based Reinforcement Learning

When environmental interaction is expensive, model-based reinforcement learning offers a solution by planning ahead and avoiding costly mistakes. Model-based agents typically learn a single-step transition model. In this paper, we propose a…

Machine Learning · Computer Science 2018-11-02 Kavosh Asadi , Evan Cater , Dipendra Misra , Michael L. Littman

Multi-Agent Reinforcement Learning with Multi-Step Generative Models

We consider model-based reinforcement learning (MBRL) in 2-agent, high-fidelity continuous control problems -- an important domain for robots interacting with other agents in the same workspace. For non-trivial dynamical systems, MBRL…

Machine Learning · Computer Science 2019-11-04 Orr Krupnik , Igor Mordatch , Aviv Tamar

Accelerating Model-Based Reinforcement Learning with State-Space World Models

Reinforcement learning (RL) is a powerful approach for robot learning. However, model-free RL (MFRL) requires a large number of environment interactions to learn successful control policies. This is due to the noisy RL training updates and…

Robotics · Computer Science 2025-02-28 Maria Krinner , Elie Aljalbout , Angel Romero , Davide Scaramuzza

A Model-based Approach for Sample-efficient Multi-task Reinforcement Learning

The aim of multi-task reinforcement learning is two-fold: (1) efficiently learn by training against multiple tasks and (2) quickly adapt, using limited samples, to a variety of new tasks. In this work, the tasks correspond to reward…

Machine Learning · Computer Science 2019-11-05 Nicholas C. Landolfi , Garrett Thomas , Tengyu Ma

Combating the Compounding-Error Problem with a Multi-step Model

Model-based reinforcement learning is an appealing framework for creating agents that learn, plan, and act in sequential environments. Model-based algorithms typically involve learning a transition model that takes a state and an action and…

Machine Learning · Computer Science 2019-06-03 Kavosh Asadi , Dipendra Misra , Seungchan Kim , Michel L. Littman

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing

Model-based reinforcement learning (MBRL) has gained much attention for its ability to learn complex behaviors in a sample-efficient way: planning actions by generating imaginary trajectories with predicted rewards. Despite its success, we…

Machine Learning · Computer Science 2024-02-20 Vint Lee , Pieter Abbeel , Youngwoon Lee

Objective Mismatch in Model-based Reinforcement Learning

Model-based reinforcement learning (MBRL) has been shown to be a powerful framework for data-efficiently learning control of continuous tasks. Recent work in MBRL has mostly focused on using more advanced function approximators and planning…

Machine Learning · Computer Science 2021-04-20 Nathan Lambert , Brandon Amos , Omry Yadan , Roberto Calandra

Should Models Be Accurate?

Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in…

Machine Learning · Computer Science 2022-05-24 Esra'a Saleh , John D. Martin , Anna Koop , Arash Pourzarabi , Michael Bowling

Reinforcement Learning based dynamic weighing of Ensemble Models for Time Series Forecasting

Ensemble models are powerful model building tools that are developed with a focus to improve the accuracy of model predictions. They find applications in time series forecasting in varied scenarios including but not limited to process…

Machine Learning · Computer Science 2020-08-21 Satheesh K. Perepu , Bala Shyamala Balaji , Hemanth Kumar Tanneru , Sudhakar Kathari , Vivek Shankar Pinnamaraju

AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning

Reinforcement learning (RL) has significantly advanced the control of physics-based and robotic characters that track kinematic reference motion. However, methods typically rely on a weighted sum of conflicting reward functions, requiring…

Robotics · Computer Science 2025-05-30 Lucas N. Alegre , Agon Serifi , Ruben Grandia , David Müller , Espen Knoop , Moritz Bächer

Continuous-Time Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the…

Machine Learning · Computer Science 2021-06-14 Çağatay Yıldız , Markus Heinonen , Harri Lähdesmäki

Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) is more sample efficient than model-free RL by using imaginary trajectories generated by the learned dynamics model. When the model is inaccurate or biased, imaginary trajectories may be deleterious…

Machine Learning · Computer Science 2021-04-12 Wenzhen Huang , Qiyue Yin , Junge Zhang , Kaiqi Huang

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient, adaptive, and explainable by learning an explicit model of the environment. While the capabilities of MBRL agents have significantly improved in recent…

Machine Learning · Computer Science 2024-04-09 Ran Wei , Nathan Lambert , Anthony McDonald , Alfredo Garcia , Roberto Calandra

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Multimodal Reward Models (MRMs) play a crucial role in enhancing the performance of Multimodal Large Language Models (MLLMs). While recent advancements have primarily focused on improving the model structure and training data of MRMs, there…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Yi-Fan Zhang , Xingyu Lu , Xiao Hu , Chaoyou Fu , Bin Wen , Tianke Zhang , Changyi Liu , Kaiyu Jiang , Kaibing Chen , Kaiyu Tang , Haojie Ding , Jiankang Chen , Fan Yang , Zhang Zhang , Tingting Gao , Liang Wang

The Virtues of Laziness in Model-based RL: A Unified Objective and Algorithms

We propose a novel approach to addressing two fundamental challenges in Model-based Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good policy in the learned model, and the objective mismatch between model…

Machine Learning · Computer Science 2023-03-02 Anirudh Vemula , Yuda Song , Aarti Singh , J. Andrew Bagnell , Sanjiban Choudhury