Related papers: Self-Correcting Models for Model-Based Reinforceme…

Should Models Be Accurate?

Model-based Reinforcement Learning (MBRL) holds promise for data-efficiency by planning with model-generated experience in addition to learning with experience from the environment. However, in complex or changing environments, models in…

Machine Learning · Computer Science 2022-05-24 Esra'a Saleh , John D. Martin , Anna Koop , Arash Pourzarabi , Michael Bowling

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Policy-shaped prediction: avoiding distractions in model-based reinforcement learning

Model-based reinforcement learning (MBRL) is a promising route to sample-efficient policy optimization. However, a known vulnerability of reconstruction-based MBRL consists of scenarios in which detailed aspects of the world are highly…

Machine Learning · Computer Science 2024-12-10 Miles Hutson , Isaac Kauvar , Nick Haber

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning

In Model-based Reinforcement Learning (MBRL), model learning is critical since an inaccurate model can bias policy learning via generating misleading samples. However, learning an accurate model can be difficult since the policy is…

Machine Learning · Computer Science 2023-01-23 Zifan Wu , Chao Yu , Chen Chen , Jianye Hao , Hankz Hankui Zhuo

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient, adaptive, and explainable by learning an explicit model of the environment. While the capabilities of MBRL agents have significantly improved in recent…

Machine Learning · Computer Science 2024-04-09 Ran Wei , Nathan Lambert , Anthony McDonald , Alfredo Garcia , Roberto Calandra

Learning the Reward Function for a Misspecified Model

In model-based reinforcement learning it is typical to decouple the problems of learning the dynamics model and learning the reward function. However, when the dynamics model is flawed, it may generate erroneous states that would never…

Machine Learning · Computer Science 2018-06-12 Erik Talvitie

On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) is a promising framework for learning control in a data-efficient manner. MBRL algorithms can be fairly complex due to the separate dynamics modeling and the subsequent planning algorithm, and as a…

Machine Learning · Computer Science 2021-03-01 Baohe Zhang , Raghu Rajan , Luis Pineda , Nathan Lambert , André Biedenkapp , Kurtland Chua , Frank Hutter , Roberto Calandra

Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

Model-based reinforcement learning (MBRL) aims to learn model(s) of the environment dynamics that can predict the outcome of its actions. Forward application of the model yields so called imagined trajectories (sequences of action,…

Artificial Intelligence · Computer Science 2024-07-31 Adrian Remonda , Eduardo Veas , Granit Luzhnica

Investigating Compounding Prediction Errors in Learned Dynamics Models

Accurately predicting the consequences of agents' actions is a key prerequisite for planning in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies on the iterative learning and prediction of state-action…

Machine Learning · Computer Science 2022-03-21 Nathan Lambert , Kristofer Pister , Roberto Calandra

Mitigating LLM Hallucination via Behaviorally Calibrated Reinforcement Learning

LLM deployment in critical domains is currently impeded by persistent hallucinations--generating plausible but factually incorrect assertions. While scaling laws drove significant improvements in general capabilities, theoretical frameworks…

Machine Learning · Computer Science 2026-01-29 Jiayun Wu , Jiashuo Liu , Zhiyuan Zeng , Tianyang Zhan , Tianle Cai , Wenhao Huang

When to Update Your Model: Constrained Model-based Reinforcement Learning

Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic improvement has been challenging, mainly due to the interdependence between policy optimization and model learning. Existing discrepancy bounds generally…

Machine Learning · Computer Science 2023-11-09 Tianying Ji , Yu Luo , Fuchun Sun , Mingxuan Jing , Fengxiang He , Wenbing Huang

Planning with Expectation Models for Control

In model-based reinforcement learning (MBRL), Wan et al. (2019) showed conditions under which the environment model could produce the expectation of the next feature vector rather than the full distribution, or a sample thereof, with no…

Artificial Intelligence · Computer Science 2021-04-20 Katya Kudashkina , Yi Wan , Abhishek Naik , Richard S. Sutton

Model Imitation for Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) aims to learn a dynamic model to reduce the number of interactions with real-world environments. However, due to estimation error, rollouts in the learned model, especially those of long horizons,…

Machine Learning · Computer Science 2020-03-17 Yueh-Hua Wu , Ting-Han Fan , Peter J. Ramadge , Hao Su

Model-based Lookahead Reinforcement Learning

Model-based Reinforcement Learning (MBRL) allows data-efficient learning which is required in real world applications such as robotics. However, despite the impressive data-efficiency, MBRL does not achieve the final performance of…

Machine Learning · Computer Science 2019-08-19 Zhang-Wei Hong , Joni Pajarinen , Jan Peters

Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control

Model-based reinforcement learning (MBRL) is believed to have much higher sample efficiency compared to model-free algorithms by learning a predictive model of the environment. However, the performance of MBRL highly relies on the quality…

Machine Learning · Computer Science 2022-11-16 Xin-Yang Liu , Jian-Xun Wang

DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations

Top-performing Model-Based Reinforcement Learning (MBRL) agents, such as Dreamer, learn the world model by reconstructing the image observations. Hence, they often fail to discard task-irrelevant details and struggle to handle visual…

Machine Learning · Computer Science 2021-10-28 Fei Deng , Ingook Jang , Sungjin Ahn

Can Large Reasoning Models Self-Train?

Recent successes of reinforcement learning (RL) in training large reasoning models motivate the question of whether self-training - the process where a model learns from its own judgments - can be sustained within RL. In this work, we study…

Machine Learning · Computer Science 2025-10-10 Sheikh Shafayat , Fahim Tajwar , Ruslan Salakhutdinov , Jeff Schneider , Andrea Zanette

Benchmarking Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) is widely seen as having the potential to be significantly more sample efficient than model-free RL. However, research in model-based RL has not been very standardized. It is fairly common for…

Machine Learning · Computer Science 2019-07-04 Tingwu Wang , Xuchan Bao , Ignasi Clavera , Jerrick Hoang , Yeming Wen , Eric Langlois , Shunshi Zhang , Guodong Zhang , Pieter Abbeel , Jimmy Ba

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen