Related papers: Hierarchical Universal Value Function Approximator…

Universal Successor Features Approximators

The ability of a reinforcement learning (RL) agent to learn about many reward functions at the same time has many potential benefits, such as the decomposition of complex tasks into simpler ones, the exchange of information between tasks,…

Machine Learning · Computer Science 2018-12-20 Diana Borsa , André Barreto , John Quan , Daniel Mankowitz , Rémi Munos , Hado van Hasselt , David Silver , Tom Schaul

Mapping State Space using Landmarks for Universal Goal Reaching

An agent that has well understood the environment should be able to apply its skills for any given goals, leading to the fundamental problem of learning the Universal Value Function Approximator (UVFA). A UVFA learns to predict the…

Machine Learning · Computer Science 2019-08-16 Zhiao Huang , Fangchen Liu , Hao Su

What About Inputing Policy in Value Function: Policy Representation and Policy-extended Value Function Approximator

We study Policy-extended Value Function Approximator (PeVFA) in Reinforcement Learning (RL), which extends conventional value function approximator (VFA) to take as input not only the state (and action) but also an explicit policy…

Machine Learning · Computer Science 2021-12-16 Hongyao Tang , Zhaopeng Meng , Jianye Hao , Chen Chen , Daniel Graves , Dong Li , Changmin Yu , Hangyu Mao , Wulong Liu , Yaodong Yang , Wenyuan Tao , Li Wang

A Geometric Perspective on Optimal Representations for Reinforcement Learning

We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value…

Machine Learning · Computer Science 2019-06-27 Marc G. Bellemare , Will Dabney , Robert Dadashi , Adrien Ali Taiga , Pablo Samuel Castro , Nicolas Le Roux , Dale Schuurmans , Tor Lattimore , Clare Lyle

Universal Successor Representations for Transfer Reinforcement Learning

The objective of transfer reinforcement learning is to generalize from a set of previous tasks to unseen new tasks. In this work, we focus on the transfer scenario where the dynamics among tasks are the same, but their goals differ.…

Artificial Intelligence · Computer Science 2018-04-12 Chen Ma , Junfeng Wen , Yoshua Bengio

Hierarchical Reinforcement Learning with Hindsight

Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a…

Machine Learning · Computer Science 2019-03-11 Andrew Levy , Robert Platt , Kate Saenko

World Value Functions: Knowledge Representation for Learning and Planning

We propose world value functions (WVFs), a type of goal-oriented general value function that represents how to solve not just a given task, but any other goal-reaching task in an agent's environment. This is achieved by equipping an agent…

Artificial Intelligence · Computer Science 2022-06-27 Geraud Nangue Tasse , Benjamin Rosman , Steven James

Matching options to tasks using Option-Indexed Hierarchical Reinforcement Learning

The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space. Ideally, these options can be…

Machine Learning · Computer Science 2022-06-14 Kushal Chauhan , Soumya Chatterjee , Akash Reddy , Balaraman Ravindran , Pradeep Shenoy

Affordance as general value function: A computational model

General value functions (GVFs) in the reinforcement learning (RL) literature are long-term predictive summaries of the outcomes of agents following specific policies in the environment. Affordances as perceived action possibilities with…

Artificial Intelligence · Computer Science 2021-05-11 Daniel Graves , Johannes Günther , Jun Luo

Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning

This work considers two distinct settings: imitation learning and goal-conditioned reinforcement learning. In either case, effective solutions require the agent to reliably reach a specified state (a goal), or set of states (a…

Machine Learning · Computer Science 2020-02-18 Yannick Schroecker , Charles Isbell

Hierarchical Reinforcement Learning with Abductive Planning

One of the key challenges in applying reinforcement learning to real-life problems is that the amount of train-and-error required to learn a good policy increases drastically as the task becomes complex. One potential solution to this…

Machine Learning · Computer Science 2018-06-29 Kazeto Yamamoto , Takashi Onishi , Yoshimasa Tsuruoka

On Convergence Rate of Adaptive Multiscale Value Function Approximation For Reinforcement Learning

In this paper, we propose a generic framework for devising an adaptive approximation scheme for value function approximation in reinforcement learning, which introduces multiscale approximation. The two basic ingredients are multiresolution…

Machine Learning · Computer Science 2019-08-26 Tao Li , Quanyan Zhu

Unsupervised Basis Function Adaptation for Reinforcement Learning

When using reinforcement learning (RL) algorithms it is common, given a large state space, to introduce some form of approximation architecture for the value function (VF). The exact form of this architecture can have a significant effect…

Machine Learning · Computer Science 2019-02-19 Edward Barker , Charl Ras

Abstract Value Iteration for Hierarchical Reinforcement Learning

We propose a novel hierarchical reinforcement learning framework for control with continuous state and action spaces. In our framework, the user specifies subgoal regions which are subsets of states; then, we (i) learn options that serve as…

Machine Learning · Computer Science 2021-02-26 Kishor Jothimurugan , Osbert Bastani , Rajeev Alur

Reinforcement Learning with General Value Function Approximation: Provably Efficient Approach via Bounded Eluder Dimension

Value function approximation has demonstrated phenomenal empirical success in reinforcement learning (RL). Nevertheless, despite a handful of recent progress on developing theory for RL with linear function approximation, the understanding…

Machine Learning · Computer Science 2020-06-22 Ruosong Wang , Ruslan Salakhutdinov , Lin F. Yang

Recurrent Value Functions

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau

Representation Learning on Graphs: A Reinforcement Learning Application

In this work, we study value function approximation in reinforcement learning (RL) problems with high dimensional state or action spaces via a generalized version of representation policy iteration (RPI). We consider the limitations of…

Machine Learning · Computer Science 2019-01-18 Sephora Madjiheurem , Laura Toni

Reinforcement Learning with Anticipation: A Hierarchical Approach for Long-Horizon Tasks

Solving long-horizon goal-conditioned tasks remains a significant challenge in reinforcement learning (RL). Hierarchical reinforcement learning (HRL) addresses this by decomposing tasks into more manageable sub-tasks, but the automatic…

Machine Learning · Computer Science 2025-09-09 Yang Yu

Universal Policies to Learn Them All

We explore a collaborative and cooperative multi-agent reinforcement learning setting where a team of reinforcement learning agents attempt to solve a single cooperative task in a multi-scenario setting. We propose a novel multi-agent…

Multiagent Systems · Computer Science 2019-08-27 Hassam Ullah Sheikh , Ladislau Bölöni

World Value Functions: Knowledge Representation for Multitask Reinforcement Learning

An open problem in artificial intelligence is how to learn and represent knowledge that is sufficient for a general agent that needs to solve multiple tasks in a given world. In this work we propose world value functions (WVFs), which are a…

Machine Learning · Computer Science 2022-05-19 Geraud Nangue Tasse , Steven James , Benjamin Rosman