Related papers: Learning Retrospective Knowledge with Reverse Rein…

Learning predictive representations in autonomous driving to improve deep reinforcement learning

Reinforcement learning using a novel predictive representation is applied to autonomous driving to accomplish the task of driving between lane markings where substantial benefits in performance and generalization are observed on unseen test…

Machine Learning · Computer Science 2020-06-29 Daniel Graves , Nhat M. Nguyen , Kimia Hassanzadeh , Jun Jin

Offline Learning of Counterfactual Predictions for Real-World Robotic Reinforcement Learning

We consider real-world reinforcement learning (RL) of robotic manipulation tasks that involve both visuomotor skills and contact-rich skills. We aim to train a policy that maps multimodal sensory observations (vision and force) to a…

Robotics · Computer Science 2022-03-01 Jun Jin , Daniel Graves , Cameron Haigh , Jun Luo , Martin Jagersand

Using General Value Functions to Learn Domain-Backed Inventory Management Policies

We consider the inventory management problem, where the goal is to balance conflicting objectives such as availability and wastage of a large range of products in a store. We propose a reinforcement learning (RL) approach that utilises…

Machine Learning · Computer Science 2023-11-07 Durgesh Kalwar , Omkar Shelke , Harshad Khadilkar

Generalized Hindsight for Reinforcement Learning

One of the key reasons for the high sample complexity in reinforcement learning (RL) is the inability to transfer knowledge from one task to another. In standard multi-task RL settings, low-reward data collected while trying to solve one…

Machine Learning · Computer Science 2020-02-27 Alexander C. Li , Lerrel Pinto , Pieter Abbeel

Affordance as general value function: A computational model

General value functions (GVFs) in the reinforcement learning (RL) literature are long-term predictive summaries of the outcomes of agents following specific policies in the environment. Affordances as perceived action possibilities with…

Artificial Intelligence · Computer Science 2021-05-11 Daniel Graves , Johannes Günther , Jun Luo

Discovery of Useful Questions as Auxiliary Tasks

Arguably, intelligent agents ought to be able to discover their own questions so that in learning answers for them they learn unanticipated useful knowledge and skills; this departs from the focus in much of machine learning on agents…

Artificial Intelligence · Computer Science 2019-09-11 Vivek Veeriah , Matteo Hessel , Zhongwen Xu , Richard Lewis , Janarthanan Rajendran , Junhyuk Oh , Hado van Hasselt , David Silver , Satinder Singh

Discovering Object-Centric Generalized Value Functions From Pixels

Deep Reinforcement Learning has shown significant progress in extracting useful representations from high-dimensional inputs albeit using hand-crafted auxiliary tasks and pseudo rewards. Automatically learning such representations in an…

Machine Learning · Computer Science 2023-06-28 Somjit Nath , Gopeshh Raaj Subbaraj , Khimya Khetarpal , Samira Ebrahimi Kahou

Versatile Inverse Reinforcement Learning via Cumulative Rewards

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert. Current approaches usually do this with generative and uni-modal models, meaning that they…

Machine Learning · Computer Science 2021-11-16 Niklas Freymuth , Philipp Becker , Gerhard Neumann

Learning State Representations from Random Deep Action-conditional Predictions

Our main contribution in this work is an empirical finding that random General Value Functions (GVFs), i.e., deep action-conditional predictions -- random both in what feature of observations they predict as well as in the sequence of…

Machine Learning · Computer Science 2021-11-09 Zeyu Zheng , Vivek Veeriah , Risto Vuorio , Richard Lewis , Satinder Singh

Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey

Reinforcement Learning (RL) has achieved tremendous development in recent years, but still faces significant obstacles in addressing complex real-life problems due to the issues of poor system generalization, low sample efficiency as well…

Artificial Intelligence · Computer Science 2025-02-25 Chao Yu , Shicheng Ye , Hankz Hankui Zhuo

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

A significant challenge for the practical application of reinforcement learning in the real world is the need to specify an oracle reward function that correctly defines a task. Inverse reinforcement learning (IRL) seeks to avoid this…

Machine Learning · Computer Science 2019-10-16 Kelvin Xu , Ellis Ratner , Anca Dragan , Sergey Levine , Chelsea Finn

Reinforcement Learning to Solve NP-hard Problems: an Application to the CVRP

In this paper, we evaluate the use of Reinforcement Learning (RL) to solve a classic combinatorial optimization problem: the Capacitated Vehicle Routing Problem (CVRP). We formalize this problem in the RL framework and compare two of the…

Artificial Intelligence · Computer Science 2022-01-17 Leo Ardon

Outcome-based Reinforcement Learning to Predict the Future

Reinforcement Learning with Verifiable Rewards (RLVR) has been an effective approach for improving Large Language Models' reasoning in domains such as coding and mathematics. Here, we apply RLVR methods towards forecasting future real-world…

Machine Learning · Computer Science 2025-12-02 Benjamin Turtel , Danny Franklin , Kris Skotheim , Luke Hewitt , Philipp Schoenegger

Reinforcement Learning via Recurrent Convolutional Neural Networks

Deep Reinforcement Learning has enabled the learning of policies for complex tasks in partially observable environments, without explicitly learning the underlying model of the tasks. While such model-free methods achieve considerable…

Machine Learning · Computer Science 2017-01-11 Tanmay Shankar , Santosha K. Dwivedy , Prithwijit Guha

Inverse Reinforcement Learning via Deep Gaussian Process

We propose a new approach to inverse reinforcement learning (IRL) based on the deep Gaussian process (deep GP) model, which is capable of learning complicated reward structures with few demonstrations. Our model stacks multiple latent GP…

Machine Learning · Computer Science 2017-05-08 Ming Jin , Andreas Damianou , Pieter Abbeel , Costas Spanos

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

Here we propose using the successor representation (SR) to accelerate learning in a constructive knowledge system based on general value functions (GVFs). In real-world settings like robotics for unstructured and dynamic environments, it is…

Machine Learning · Computer Science 2018-03-28 Craig Sherstan , Marlos C. Machado , Patrick M. Pilarski

Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning

Reinforcement learning (RL) presents a promising framework to learn policies through environment interaction, but often requires an infeasible amount of interaction data to solve complex tasks from sparse rewards. One direction includes…

Machine Learning · Computer Science 2024-05-07 Stone Tao , Arth Shukla , Tse-kai Chan , Hao Su

Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications

Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for policy improvement and generalization. However, despite much recent interest in IRL, little work has been done to understand the minimum set of…

Machine Learning · Computer Science 2019-08-19 Daniel S. Brown , Scott Niekum

Generative Adversarial Reward Learning for Generalized Behavior Tendency Inference

Recent advances in reinforcement learning have inspired increasing interest in learning user modeling adaptively through dynamic interactions, e.g., in reinforcement learning based recommender systems. Reward function is crucial for most of…

Machine Learning · Computer Science 2021-05-06 Xiaocong Chen , Lina Yao , Xianzhi Wang , Aixin Sun , Wenjie Zhang , Quan Z. Sheng