Related papers: Value-driven Hindsight Modelling

Knowing the Past to Predict the Future: Reinforcement Virtual Learning

Reinforcement Learning (RL)-based control system has received considerable attention in recent decades. However, in many real-world problems, such as Batch Process Control, the environment is uncertain, which requires expensive interaction…

Machine Learning · Computer Science 2022-11-03 Peng Zhang , Yawen Huang , Bingzhang Hu , Shizheng Wang , Haoran Duan , Noura Al Moubayed , Yefeng Zheng , Yang Long

Perceptual Values from Observation

Imitation by observation is an approach for learning from expert demonstrations that lack action information, such as videos. Recent approaches to this problem can be placed into two broad categories: training dynamics models that aim to…

Machine Learning · Computer Science 2019-05-21 Ashley D. Edwards , Charles L. Isbell

Meta-Gradient Reinforcement Learning

The goal of reinforcement learning algorithms is to estimate and/or optimise the value function. However, unlike supervised learning, no teacher or oracle is available to provide the true value function. Instead, the majority of…

Machine Learning · Computer Science 2018-05-25 Zhongwen Xu , Hado van Hasselt , David Silver

Imagined Value Gradients: Model-Based Policy Optimization with Transferable Latent Dynamics Models

Humans are masters at quickly learning many complex tasks, relying on an approximate understanding of the dynamics of their environments. In much the same way, we would like our learning agents to quickly adapt to new tasks. In this paper,…

Robotics · Computer Science 2019-10-10 Arunkumar Byravan , Jost Tobias Springenberg , Abbas Abdolmaleki , Roland Hafner , Michael Neunert , Thomas Lampe , Noah Siegel , Nicolas Heess , Martin Riedmiller

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

In value-based reinforcement learning (RL), unlike in supervised learning, the agent faces not a single, stationary, approximation problem, but a sequence of value prediction problems. Each time the policy improves, the nature of the…

Machine Learning · Computer Science 2021-01-05 Will Dabney , André Barreto , Mark Rowland , Robert Dadashi , John Quan , Marc G. Bellemare , David Silver

RLInspect: An Interactive Visual Approach to Assess Reinforcement Learning Algorithm

Reinforcement Learning (RL) is a rapidly growing area of machine learning that finds its application in a broad range of domains, from finance and healthcare to robotics and gaming. Compared to other machine learning techniques, RL agents…

Artificial Intelligence · Computer Science 2024-11-14 Geetansh Kalra , Divye Singh , Justin Jose

Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction

Value functions are crucial for model-free Reinforcement Learning (RL) to obtain a policy implicitly or guide the policy updates. Value estimation heavily depends on the stochasticity of environmental dynamics and the quality of reward…

Machine Learning · Computer Science 2019-05-28 Hongyao Tang , Jianye Hao , Guangyong Chen , Pengfei Chen , Zhaopeng Meng , Yaodong Yang , Li Wang

Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction

Value function is the central notion of Reinforcement Learning (RL). Value estimation, especially with function approximation, can be challenging since it involves the stochasticity of environmental dynamics and reward signals that can be…

Machine Learning · Computer Science 2021-03-04 Hongyao Tang , Jianye Hao , Guangyong Chen , Pengfei Chen , Chen Chen , Yaodong Yang , Luo Zhang , Wulong Liu , Zhaopeng Meng

Model predictive control-based value estimation for efficient reinforcement learning

Reinforcement learning suffers from limitations in real practices primarily due to the number of required interactions with virtual environments. It results in a challenging problem because we are implausible to obtain a local optimal…

Machine Learning · Computer Science 2024-10-28 Qizhen Wu , Kexin Liu , Lei Chen

Learning Representations in Model-Free Hierarchical Reinforcement Learning

Common approaches to Reinforcement Learning (RL) are seriously challenged by large-scale applications involving huge state spaces and sparse delayed reward feedback. Hierarchical Reinforcement Learning (HRL) methods attempt to address this…

Artificial Intelligence · Computer Science 2019-04-15 Jacob Rafati , David C. Noelle

Hierarchical Reinforcement Learning with Hindsight

Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a…

Machine Learning · Computer Science 2019-03-11 Andrew Levy , Robert Platt , Kate Saenko

Reinforcement Learning with Lookahead Information

We study reinforcement learning (RL) problems in which agents observe the reward or transition realizations at their current state before deciding which action to take. Such observations are available in many applications, including…

Machine Learning · Computer Science 2024-10-22 Nadav Merlis

Model-Based Value Estimation for Efficient Model-Free Reinforcement Learning

Recent model-free reinforcement learning algorithms have proposed incorporating learned dynamics models as a source of additional data with the intention of reducing sample complexity. Such methods hold the promise of incorporating imagined…

Machine Learning · Computer Science 2018-03-02 Vladimir Feinberg , Alvin Wan , Ion Stoica , Michael I. Jordan , Joseph E. Gonzalez , Sergey Levine

Model-Based Reinforcement Learning with Multi-Task Offline Pretraining

Pretraining reinforcement learning (RL) models on offline datasets is a promising way to improve their training efficiency in online tasks, but challenging due to the inherent mismatch in dynamics and behaviors across various tasks. We…

Machine Learning · Computer Science 2024-06-06 Minting Pan , Yitao Zheng , Yunbo Wang , Xiaokang Yang

On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency

Identifying the trade-offs between model-based and model-free methods is a central question in reinforcement learning. Value-based methods offer substantial computational advantages and are sometimes just as statistically efficient as…

Machine Learning · Computer Science 2024-03-13 David Cheikhi , Daniel Russo

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Value Driven Representation for Human-in-the-Loop Reinforcement Learning

Interactive adaptive systems powered by Reinforcement Learning (RL) have many potential applications, such as intelligent tutoring systems. In such systems there is typically an external human system designer that is creating, monitoring…

Artificial Intelligence · Computer Science 2020-04-06 Ramtin Keramati , Emma Brunskill

Reward Models in Deep Reinforcement Learning: A Survey

In reinforcement learning (RL), agents continually interact with the environment and use the feedback to refine their behavior. To guide policy optimization, reward models are introduced as proxies of the desired objectives, such that when…

Machine Learning · Computer Science 2025-06-19 Rui Yu , Shenghua Wan , Yucen Wang , Chen-Xiao Gao , Le Gan , Zongzhang Zhang , De-Chuan Zhan

Model-Based Policy Gradients with Parameter-Based Exploration by Least-Squares Conditional Density Estimation

The goal of reinforcement learning (RL) is to let an agent learn an optimal control policy in an unknown environment so that future expected rewards are maximized. The model-free RL approach directly learns the policy based on data samples.…

Machine Learning · Statistics 2013-07-22 Syogo Mori , Voot Tangkaratt , Tingting Zhao , Jun Morimoto , Masashi Sugiyama

Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) is more sample efficient than model-free RL by using imaginary trajectories generated by the learned dynamics model. When the model is inaccurate or biased, imaginary trajectories may be deleterious…

Machine Learning · Computer Science 2021-04-12 Wenzhen Huang , Qiyue Yin , Junge Zhang , Kaiqi Huang