Related papers: Value Gradient weighted Model-Based Reinforcement …

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

Calibrated Value-Aware Model Learning with Probabilistic Environment Models

The idea of value-aware model learning, that models should produce accurate value estimates, has gained prominence in model-based reinforcement learning. The MuZero loss, which penalizes a model's value function prediction compared to the…

Machine Learning · Computer Science 2025-06-10 Claas Voelcker , Anastasiia Pedan , Arash Ahmadian , Romina Abachi , Igor Gilitschenski , Amir-massoud Farahmand

Model-based Lookahead Reinforcement Learning

Model-based Reinforcement Learning (MBRL) allows data-efficient learning which is required in real world applications such as robotics. However, despite the impressive data-efficiency, MBRL does not achieve the final performance of…

Machine Learning · Computer Science 2019-08-19 Zhang-Wei Hong , Joni Pajarinen , Jan Peters

Policy-Aware Model Learning for Policy Gradient Methods

This paper considers the problem of learning a model in model-based reinforcement learning (MBRL). We examine how the planning module of an MBRL algorithm uses the model, and propose that the model learning module should incorporate the way…

Artificial Intelligence · Computer Science 2021-01-05 Romina Abachi , Mohammad Ghavamzadeh , Amir-massoud Farahmand

Model-based Reinforcement Learning with Multi-step Plan Value Estimation

A promising way to improve the sample efficiency of reinforcement learning is model-based methods, in which many explorations and evaluations can happen in the learned models to save real-world samples. However, when the learned model has a…

Machine Learning · Computer Science 2022-09-14 Haoxin Lin , Yihao Sun , Jiaji Zhang , Yang Yu

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Variance Reduced Policy Gradient Method for Multi-Objective Reinforcement Learning

Multi-Objective Reinforcement Learning (MORL) is a generalization of traditional Reinforcement Learning (RL) that aims to optimize multiple, often conflicting objectives simultaneously rather than focusing on a single reward. This approach…

Machine Learning · Computer Science 2025-08-15 Davide Guidobene , Lorenzo Benedetti , Diego Arapovic

Policy-shaped prediction: avoiding distractions in model-based reinforcement learning

Model-based reinforcement learning (MBRL) is a promising route to sample-efficient policy optimization. However, a known vulnerability of reconstruction-based MBRL consists of scenarios in which detailed aspects of the world are highly…

Machine Learning · Computer Science 2024-12-10 Miles Hutson , Isaac Kauvar , Nick Haber

Equivalence Between Wasserstein and Value-Aware Loss for Model-based Reinforcement Learning

Learning a generative model is a key component of model-based reinforcement learning. Though learning a good model in the tabular setting is a simple task, learning a useful model in the approximate setting is challenging. In this context,…

Machine Learning · Computer Science 2018-07-10 Kavosh Asadi , Evan Cater , Dipendra Misra , Michael L. Littman

TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching

Standard model-based reinforcement learning (MBRL) approaches fit a transition model of the environment to all past experience, but this wastes model capacity on data that is irrelevant for policy improvement. We instead propose a new…

Machine Learning · Computer Science 2023-05-23 Yecheng Jason Ma , Kausik Sivakumar , Jason Yan , Osbert Bastani , Dinesh Jayaraman

Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning

Learning to reach goal states and learning diverse skills through mutual information (MI) maximization have been proposed as principled frameworks for self-supervised reinforcement learning, allowing agents to acquire broadly applicable…

Machine Learning · Computer Science 2021-06-04 Jongwook Choi , Archit Sharma , Honglak Lee , Sergey Levine , Shixiang Shane Gu

Robust Model Based Reinforcement Learning Using $\mathcal{L}_1$ Adaptive Control

We introduce $\mathcal{L}_1$-MBRL, a control-theoretic augmentation scheme for Model-Based Reinforcement Learning (MBRL) algorithms. Unlike model-free approaches, MBRL algorithms learn a model of the transition function using data and use…

Systems and Control · Electrical Eng. & Systems 2024-03-25 Minjun Sung , Sambhu H. Karumanchi , Aditya Gahlawat , Naira Hovakimyan

Stock Trading Optimization through Model-based Reinforcement Learning with Resistance Support Relative Strength

Reinforcement learning (RL) is gaining attention by more and more researchers in quantitative finance as the agent-environment interaction framework is aligned with decision making process in many business problems. Most of the current…

Mathematical Finance · Quantitative Finance 2022-05-31 Huifang Huang , Ting Gao , Yi Gui , Jin Guo , Peng Zhang

Multi-timestep models for Model-based Reinforcement Learning

In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as length of…

Machine Learning · Computer Science 2023-10-12 Abdelhakim Benechehab , Giuseppe Paolo , Albert Thomas , Maurizio Filippone , Balázs Kégl

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

Reinforcement Learning (RL) methods are typically applied directly in environments to learn policies. In some complex environments with continuous state-action spaces, sparse rewards, and/or long temporal horizons, learning a good policy in…

Machine Learning · Computer Science 2023-05-03 Deyao Zhu , Li Erran Li , Mohamed Elhoseiny

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient, adaptive, and explainable by learning an explicit model of the environment. While the capabilities of MBRL agents have significantly improved in recent…

Machine Learning · Computer Science 2024-04-09 Ran Wei , Nathan Lambert , Anthony McDonald , Alfredo Garcia , Roberto Calandra

Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning

Offline reinforcement learning (RL) offers a powerful paradigm for data-driven control. Compared to model-free approaches, offline model-based RL (MBRL) explicitly learns a world model from a static dataset and uses it as a surrogate…

Machine Learning · Computer Science 2026-02-02 Jiayu Chen , Le Xu , Aravind Venugopal , Jeff Schneider

Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning

Reinforcement learning (RL) requires either manually specifying a reward function, which is often infeasible, or learning a reward model from a large amount of human feedback, which is often very expensive. We study a more sample-efficient…

Machine Learning · Computer Science 2024-03-15 Juan Rocamonde , Victoriano Montesinos , Elvis Nava , Ethan Perez , David Lindner

Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills

Mutual information-based reinforcement learning (RL) has been proposed as a promising framework for retrieving complex skills autonomously without a task-oriented reward function through mutual information (MI) maximization or variational…

Machine Learning · Computer Science 2023-10-31 Seongun Kim , Kyowoon Lee , Jaesik Choi

Models, Pixels, and Rewards: Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) methods have shown strong sample efficiency and performance across a variety of tasks, including when faced with high-dimensional visual observations. These methods learn to predict the environment…

Machine Learning · Computer Science 2020-12-09 Mohammad Babaeizadeh , Mohammad Taghi Saffar , Danijar Hafner , Harini Kannan , Chelsea Finn , Sergey Levine , Dumitru Erhan