Related papers: Revisiting Model-based Value Expansion

Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning

Model-based reinforcement learning is one approach to increase sample efficiency. However, the accuracy of the dynamics model and the resulting compounding error over modelled trajectories are commonly regarded as key limitations. A natural…

Machine Learning · Computer Science 2023-03-08 Daniel Palenicek , Michael Lutter , Joao Carvalho , Jan Peters

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

Recent model-free reinforcement learning algorithms have proposed incorporating learned dynamics models as a source of additional data with the intention of reducing sample complexity. Such methods hold the promise of incorporating imagined…

Machine Learning · Computer Science 2018-03-02 Vladimir Feinberg , Alvin Wan , Ion Stoica , Michael I. Jordan , Joseph E. Gonzalez , Sergey Levine

Diminishing Return of Value Expansion Methods

Model-based reinforcement learning aims to increase sample efficiency, but the accuracy of dynamics models and the resulting compounding errors are often seen as key limitations. This paper empirically investigates potential sample…

Machine Learning · Computer Science 2024-12-31 Daniel Palenicek , Michael Lutter , João Carvalho , Daniel Dennert , Faran Ahmad , Jan Peters

On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency

Identifying the trade-offs between model-based and model-free methods is a central question in reinforcement learning. Value-based methods offer substantial computational advantages and are sometimes just as statistically efficient as…

Machine Learning · Computer Science 2024-03-13 David Cheikhi , Daniel Russo

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Combating the Compounding-Error Problem with a Multi-step Model

Model-based reinforcement learning is an appealing framework for creating agents that learn, plan, and act in sequential environments. Model-based algorithms typically involve learning a transition model that takes a state and an action and…

Machine Learning · Computer Science 2019-06-03 Kavosh Asadi , Dipendra Misra , Seungchan Kim , Michel L. Littman

Model-Advantage and Value-Aware Models for Model-Based Reinforcement Learning: Bridging the Gap in Theory and Practice

This work shows that value-aware model learning, known for its numerous theoretical benefits, is also practically viable for solving challenging continuous control tasks in prevalent model-based reinforcement learning algorithms. First, we…

Machine Learning · Computer Science 2022-01-31 Nirbhay Modhe , Harish Kamath , Dhruv Batra , Ashwin Kalyan

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually simple approach to the decision-making and control problem: use learning for the purpose of estimating an approximate dynamics model, and offload the rest of the work to…

Machine Learning · Computer Science 2023-07-13 Michael Janner

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

Integrating model-free and model-based approaches in reinforcement learning has the potential to achieve the high performance of model-free algorithms with low sample complexity. However, this is difficult because an imperfect dynamics…

Machine Learning · Computer Science 2019-06-10 Jacob Buckman , Danijar Hafner , George Tucker , Eugene Brevdo , Honglak Lee

Deep Incremental Model Informed Reinforcement Learning for Continuous Robotic Control

Model-based reinforcement learning attempts to use an available or learned model to improve the data efficiency of reinforcement learning. This work proposes a one-step lookback approach that jointly learns the deep incremental model and…

Robotics · Computer Science 2025-02-28 Cong Li

Dynamic Horizon Value Estimation for Model-based Reinforcement Learning

Existing model-based value expansion methods typically leverage a world model for value estimation with a fixed rollout horizon to assist policy learning. However, the fixed rollout with an inaccurate model has a potential to harm the…

Machine Learning · Computer Science 2020-09-22 Junjie Wang , Qichao Zhang , Dongbin Zhao , Mengchen Zhao , Jianye Hao

Reinforcement Learning by Value Gradients

The concept of the value-gradient is introduced and developed in the context of reinforcement learning. It is shown that by learning the value-gradients exploration or stochastic behaviour is no longer needed to find locally optimal…

Neural and Evolutionary Computing · Computer Science 2008-03-26 Michael Fairbank

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a…

Machine Learning · Computer Science 2022-09-14 Haoxin Lin , Yihao Sun , Jiaji Zhang , Yang Yu

Model-Based Reinforcement Learning for Control under Time-Varying Dynamics

Learning-based control methods typically assume stationary system dynamics, an assumption often violated in real-world systems due to drift, wear, or changing operating conditions. We study reinforcement learning for control under…

Machine Learning · Computer Science 2026-04-03 Klemens Iten , Bruce Lee , Chenhao Li , Lenart Treven , Andreas Krause , Bhavya Sukhija

Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings

Learning continuous control in high-dimensional sparse reward settings, such as robotic manipulation, is a challenging problem due to the number of samples often required to obtain accurate optimal value and policy estimates. While many…

Robotics · Computer Science 2021-07-29 Sreehari Rammohan , Shangqun Yu , Bowen He , Eric Hsiung , Eric Rosen , Stefanie Tellex , George Konidaris

Modeling Survival in model-based Reinforcement Learning

Although recent model-free reinforcement learning algorithms have been shown to be capable of mastering complicated decision-making tasks, the sample complexity of these methods has remained a hurdle to utilizing them in many real-world…

Machine Learning · Computer Science 2020-04-21 Saeed Moazami , Peggy Doerschuk

High-Accuracy Model-Based Reinforcement Learning, a Survey

Deep reinforcement learning has shown remarkable success in the past few years. Highly complex sequential decision making problems from game playing and robotics have been solved with deep model-free methods. Unfortunately, the sample…

Machine Learning · Computer Science 2021-07-20 Aske Plaat , Walter Kosters , Mike Preuss

Model-based deep reinforcement learning for accelerated learning from flow simulations

In recent years, deep reinforcement learning has emerged as a technique to solve closed-loop flow control problems. Employing simulation-based environments in reinforcement learning enables a priori end-to-end optimization of the control…

Fluid Dynamics · Physics 2024-04-11 Andre Weiner , Janis Geise

Is there Value in Reinforcement Learning?

Action-values play a central role in popular Reinforcement Learing (RL) models of behavior. Yet, the idea that action-values are explicitly represented has been extensively debated. Critics had therefore repeatedly suggested that…

Artificial Intelligence · Computer Science 2025-05-09 Lior Fox , Yonatan Loewenstein

On the model-based stochastic value gradient for continuous reinforcement learning

For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of reinforcement learning agents. While model-based agents are conceptually appealing,…

Machine Learning · Computer Science 2021-05-28 Brandon Amos , Samuel Stanton , Denis Yarats , Andrew Gordon Wilson