Related papers: Predecessor Features

Expected Eligibility Traces

The question of how to determine which states and actions are responsible for a certain outcome is known as the credit assignment problem and remains a central research question in reinforcement learning and artificial intelligence.…

Machine Learning · Computer Science 2021-02-09 Hado van Hasselt , Sephora Madjiheurem , Matteo Hessel , David Silver , André Barreto , Diana Borsa

Hindsight Credit Assignment

We consider the problem of efficient credit assignment in reinforcement learning. In order to efficiently and meaningfully utilize new data, we propose to explicitly assign credit to past decisions based on the likelihood of them having led…

Machine Learning · Computer Science 2019-12-06 Anna Harutyunyan , Will Dabney , Thomas Mesnard , Mohammad Azar , Bilal Piot , Nicolas Heess , Hado van Hasselt , Greg Wayne , Satinder Singh , Doina Precup , Remi Munos

One-shot learning and behavioral eligibility traces in sequential decision making

In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning theory suggests two classes of…

Neurons and Cognition · Quantitative Biology 2019-11-13 Marco Lehmann , He Xu , Vasiliki Liakoni , Michael Herzog , Wulfram Gerstner , Kerstin Preuschoff

Towards Causal Credit Assignment

Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in…

Machine Learning · Computer Science 2023-05-18 Mátyás Schubert

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents. In the extreme case, long trajectories of behavior are merely punctuated…

Machine Learning · Computer Science 2023-08-22 Akash Velu , Skanda Vaidyanath , Dilip Arumugam

Forethought and Hindsight in Credit Assignment

We address the problem of credit assignment in reinforcement learning and explore fundamental questions regarding the way in which an agent can best use additional computation to propagate new information, by planning with internal models…

Machine Learning · Computer Science 2020-10-27 Veronica Chelu , Doina Precup , Hado van Hasselt

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

In many environments only a tiny subset of all states yield high reward. In these cases, few of the interactions with the environment provide a relevant learning signal. Hence, we may want to preferentially train on those high-reward states…

Machine Learning · Computer Science 2019-01-30 Anirudh Goyal , Philemon Brakel , William Fedus , Soumye Singhal , Timothy Lillicrap , Sergey Levine , Hugo Larochelle , Yoshua Bengio

Advantages and Limitations of using Successor Features for Transfer in Reinforcement Learning

One question central to Reinforcement Learning is how to learn a feature representation that supports algorithm scaling and re-use of learned information from different tasks. Successor Features approach this problem by learning a feature…

Artificial Intelligence · Computer Science 2017-08-02 Lucas Lehnert , Stefanie Tellex , Michael L. Littman

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Credit assignment in reinforcement learning is the problem of measuring an action's influence on future rewards. In particular, this requires separating skill from luck, i.e. disentangling the effect of an action on rewards from that of…

Machine Learning · Computer Science 2021-12-15 Thomas Mesnard , Théophane Weber , Fabio Viola , Shantanu Thakoor , Alaa Saade , Anna Harutyunyan , Will Dabney , Tom Stepleton , Nicolas Heess , Arthur Guez , Éric Moulines , Marcus Hutter , Lars Buesing , Rémi Munos

Backward Curriculum Reinforcement Learning

Current reinforcement learning algorithms train an agent using forward-generated trajectories, which provide little guidance so that the agent can explore as much as possible. While realizing the value of reinforcement learning results from…

Artificial Intelligence · Computer Science 2023-09-06 KyungMin Ko

Demystifying the Recency Heuristic in Temporal-Difference Learning

The recency heuristic in reinforcement learning is the assumption that stimuli that occurred closer in time to an acquired reward should be more heavily reinforced. The recency heuristic is one of the key assumptions made by TD($\lambda$),…

Machine Learning · Computer Science 2024-08-27 Brett Daley , Marlos C. Machado , Martha White

Continual Auxiliary Task Learning

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there…

Machine Learning · Computer Science 2022-02-24 Matthew McLeod , Chunlok Lo , Matthew Schlegel , Andrew Jacobsen , Raksha Kumaraswamy , Martha White , Adam White

Reinforcement Learning in System Identification

System identification, also known as learning forward models, transfer functions, system dynamics, etc., has a long tradition both in science and engineering in different fields. Particularly, it is a recurring theme in Reinforcement…

Machine Learning · Computer Science 2022-12-15 Jose Antonio Martin H. , Oscar Fernandez Vicente , Sergio Perez , Anas Belfadil , Cristina Ibanez-Llano , Freddy Jose Perozo Rondon , Jose Javier Valle , Javier Arechalde Pelaz

Accelerating Reinforcement Learning by Composing Solutions of Automatically Identified Subtasks

This paper discusses a system that accelerates reinforcement learning by using transfer from related tasks. Without such transfer, even if two tasks are very similar at some abstract level, an extensive re-learning effort is required. The…

Artificial Intelligence · Computer Science 2011-06-10 C. Drummond

Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer

In many real-world applications, reinforcement learning (RL) agents might have to solve multiple tasks, each one typically modeled via a reward function. If reward functions are expressed linearly, and the agent has previously learned a set…

Machine Learning · Computer Science 2022-06-24 Lucas N. Alegre , Ana L. C. Bazzan , Bruno C. da Silva

Transfer with Model Features in Reinforcement Learning

A key question in Reinforcement Learning is which representation an agent can learn to efficiently reuse knowledge between different tasks. Recently the Successor Representation was shown to have empirical benefits for transferring…

Machine Learning · Computer Science 2018-07-06 Lucas Lehnert , Michael L. Littman

Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding

Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit…

Machine Learning · Computer Science 2018-09-12 Nan Rosemary Ke , Anirudh Goyal , Olexa Bilaniuk , Jonathan Binas , Michael C. Mozer , Chris Pal , Yoshua Bengio

Towards Practical Credit Assignment for Deep Reinforcement Learning

Credit assignment is a fundamental problem in reinforcement learning, the problem of measuring an action's influence on future rewards. Explicit credit assignment methods have the potential to boost the performance of RL algorithms on many…

Machine Learning · Computer Science 2022-02-15 Vyacheslav Alipov , Riley Simmons-Edler , Nikita Putintsev , Pavel Kalinin , Dmitry Vetrov

Hindsight PRIORs for Reward Learning from Human Preferences

Preference based Reinforcement Learning (PbRL) removes the need to hand specify a reward function by learning a reward from preference feedback over policy behaviors. Current approaches to PbRL do not address the credit assignment problem…

Machine Learning · Computer Science 2024-04-16 Mudit Verma , Katherine Metcalf

Successor Features Combine Elements of Model-Free and Model-based Reinforcement Learning

A key question in reinforcement learning is how an intelligent agent can generalize knowledge across different inputs. By generalizing across different inputs, information learned for one input can be immediately reused for improving…

Machine Learning · Computer Science 2020-10-06 Lucas Lehnert , Michael L. Littman