Related papers: World Value Functions: Knowledge Representation fo…

World Value Functions: Knowledge Representation for Learning and Planning

We propose world value functions (WVFs), a type of goal-oriented general value function that represents how to solve not just a given task, but any other goal-reaching task in an agent's environment. This is achieved by equipping an agent…

Artificial Intelligence · Computer Science 2022-06-27 Geraud Nangue Tasse , Benjamin Rosman , Steven James

Visual Reinforcement Learning with Imagined Goals

For an autonomous agent to fulfill a wide range of user-specified goals at test time, it must be able to learn broadly applicable and general-purpose skill repertoires. Furthermore, to provide the requisite level of generality, these skills…

Machine Learning · Computer Science 2018-12-05 Ashvin Nair , Vitchyr Pong , Murtaza Dalal , Shikhar Bahl , Steven Lin , Sergey Levine

Discovery of Useful Questions as Auxiliary Tasks

Arguably, intelligent agents ought to be able to discover their own questions so that in learning answers for them they learn unanticipated useful knowledge and skills; this departs from the focus in much of machine learning on agents…

Artificial Intelligence · Computer Science 2019-09-11 Vivek Veeriah , Matteo Hessel , Zhongwen Xu , Richard Lewis , Janarthanan Rajendran , Junhyuk Oh , Hado van Hasselt , David Silver , Satinder Singh

Introspective Agents: Confidence Measures for General Value Functions

Agents of general intelligence deployed in real-world scenarios must adapt to ever-changing environmental conditions. While such adaptive agents may leverage engineered knowledge, they will require the capacity to construct and evaluate…

Artificial Intelligence · Computer Science 2016-06-20 Craig Sherstan , Adam White , Marlos C. Machado , Patrick M. Pilarski

What's a Good Prediction? Challenges in evaluating an agent's knowledge

Constructing general knowledge by learning task-independent models of the world can help agents solve challenging problems. However, both constructing and evaluating such models remains an open challenge. The most common approaches to…

Artificial Intelligence · Computer Science 2021-04-15 Alex Kearney , Anna Koop , Patrick M. Pilarski

Representing Knowledge as Predictions (and State as Knowledge)

This paper shows how a single mechanism allows knowledge to be constructed layer by layer directly from an agent's raw sensorimotor stream. This mechanism, the General Value Function (GVF) or "forecast," captures high-level, abstract…

Artificial Intelligence · Computer Science 2021-12-14 Mark Ring

What Can Learned Intrinsic Rewards Capture?

The objective of a reinforcement learning agent is to behave so as to maximise the sum of a suitable scalar function of state: the reward. These rewards are typically given and immutable. In this paper, we instead consider the proposition…

Artificial Intelligence · Computer Science 2020-08-25 Zeyu Zheng , Junhyuk Oh , Matteo Hessel , Zhongwen Xu , Manuel Kroiss , Hado van Hasselt , David Silver , Satinder Singh

Value function estimation using conditional diffusion models for control

A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative…

Machine Learning · Computer Science 2023-06-14 Bogdan Mazoure , Walter Talbott , Miguel Angel Bautista , Devon Hjelm , Alexander Toshev , Josh Susskind

Multi Task Inverse Reinforcement Learning for Common Sense Reward

One of the challenges in applying reinforcement learning in a complex real-world environment lies in providing the agent with a sufficiently detailed reward function. Any misalignment between the reward and the desired behavior can result…

Machine Learning · Computer Science 2025-10-24 Neta Glazer , Aviv Navon , Aviv Shamsian , Ethan Fetaya

Training Value-Aligned Reinforcement Learning Agents Using a Normative Prior

As more machine learning agents interact with humans, it is increasingly a prospect that an agent trained to perform a task optimally, using only a measure of task performance as feedback, can violate societal norms for acceptable behavior…

Machine Learning · Computer Science 2021-04-20 Md Sultan Al Nahian , Spencer Frazier , Brent Harrison , Mark Riedl

What Should I Know? Using Meta-gradient Descent for Predictive Feature Discovery in a Single Stream of Experience

In computational reinforcement learning, a growing body of work seeks to construct an agent's perception of the world through predictions of future sensations; predictions about environment observations are used as additional input features…

Machine Learning · Computer Science 2022-06-15 Alexandra Kearney , Anna Koop , Johannes Günther , Patrick M. Pilarski

A Geometric Perspective on Optimal Representations for Reinforcement Learning

We propose a new perspective on representation learning in reinforcement learning based on geometric properties of the space of value functions. We leverage this perspective to provide formal evidence regarding the usefulness of value…

Machine Learning · Computer Science 2019-06-27 Marc G. Bellemare , Will Dabney , Robert Dadashi , Adrien Ali Taiga , Pablo Samuel Castro , Nicolas Le Roux , Dale Schuurmans , Tor Lattimore , Clare Lyle

Affordance as general value function: A computational model

General value functions (GVFs) in the reinforcement learning (RL) literature are long-term predictive summaries of the outcomes of agents following specific policies in the environment. Affordances as perceived action possibilities with…

Artificial Intelligence · Computer Science 2021-05-11 Daniel Graves , Johannes Günther , Jun Luo

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an…

Artificial Intelligence · Computer Science 2021-03-09 Ran Tian , Masayoshi Tomizuka , Liting Sun

Perceptual Reward Functions

Reinforcement learning problems are often described through rewards that indicate if an agent has completed some task. This specification can yield desirable behavior, however many problems are difficult to specify in this manner, as one…

Artificial Intelligence · Computer Science 2016-08-15 Ashley Edwards , Charles Isbell , Atsuo Takanishi

Learning State Representations from Random Deep Action-conditional Predictions

Our main contribution in this work is an empirical finding that random General Value Functions (GVFs), i.e., deep action-conditional predictions -- random both in what feature of observations they predict as well as in the sequence of…

Machine Learning · Computer Science 2021-11-09 Zeyu Zheng , Vivek Veeriah , Risto Vuorio , Richard Lewis , Satinder Singh

Recurrent Value Functions

Despite recent successes in Reinforcement Learning, value-based methods often suffer from high variance hindering performance. In this paper, we illustrate this in a continuous control setting where state of the art methods perform poorly…

Machine Learning · Computer Science 2019-05-24 Pierre Thodoroff , Nishanth Anand , Lucas Caccia , Doina Precup , Joelle Pineau

Automatic Goal Generation for Reinforcement Learning Agents

Reinforcement learning is a powerful technique to train an agent to perform a task. However, an agent that is trained using reinforcement learning is only capable of achieving the single task that is specified via its reward function. Such…

Machine Learning · Computer Science 2018-07-24 Carlos Florensa , David Held , Xinyang Geng , Pieter Abbeel

Lifelong Inverse Reinforcement Learning

Methods for learning from demonstration (LfD) have shown success in acquiring behavior policies by imitating a user. However, even for a single task, LfD may require numerous demonstrations. For versatile agents that must learn many tasks…

Machine Learning · Computer Science 2022-07-04 Jorge A. Mendez , Shashank Shivkumar , Eric Eaton

Discovering Object-Centric Generalized Value Functions From Pixels

Deep Reinforcement Learning has shown significant progress in extracting useful representations from high-dimensional inputs albeit using hand-crafted auxiliary tasks and pseudo rewards. Automatically learning such representations in an…

Machine Learning · Computer Science 2023-06-28 Somjit Nath , Gopeshh Raaj Subbaraj , Khimya Khetarpal , Samira Ebrahimi Kahou