Related papers: Expected Eligibility Traces

Predecessor Features

Any reinforcement learning system must be able to identify which past events contributed to observed outcomes, a problem known as credit assignment. A common solution to this problem is to use an eligibility trace to assign credit to…

Machine Learning · Computer Science 2022-07-26 Duncan Bailey , Marcelo G. Mattar

One-shot learning and behavioral eligibility traces in sequential decision making

In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning theory suggests two classes of…

Neurons and Cognition · Quantitative Biology 2019-11-13 Marco Lehmann , He Xu , Vasiliki Liakoni , Michael Herzog , Wulfram Gerstner , Kerstin Preuschoff

META-Learning State-based Eligibility Traces for More Sample-Efficient Policy Evaluation

Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies.…

Machine Learning · Computer Science 2020-05-19 Mingde Zhao , Sitao Luan , Ian Porada , Xiao-Wen Chang , Doina Precup

META-Learning Eligibility Traces for More Sample Efficient Temporal Difference Learning

Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies.…

Machine Learning · Computer Science 2020-06-17 Mingde Zhao

Learning From the Past with Cascading Eligibility Traces

Animals often receive information about errors and rewards after a significant delay. For example, there is typically a delay of tens to hundreds of milliseconds between motor actions and visual feedback. The standard approach to handling…

Neurons and Cognition · Quantitative Biology 2025-06-18 Tokiniaina Raharison Ralambomihanta , Ivan Anokhin , Roman Pogodin , Samira Ebrahimi Kahou , Jonathan Cornford , Blake Aaron Richards

From Past to Future: Rethinking Eligibility Traces

In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit…

Machine Learning · Computer Science 2023-12-21 Dhawal Gupta , Scott M. Jordan , Shreyas Chaudhari , Bo Liu , Philip S. Thomas , Bruno Castro da Silva

Theta sequences as eligibility traces: a biological solution to credit assignment

Credit assignment problems, for example policy evaluation in RL, often require bootstrapping prediction errors through preceding states \textit{or} maintaining temporally extended memory traces; solutions which are unfavourable or…

Neurons and Cognition · Quantitative Biology 2023-05-16 Tom M George

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

In many environments only a tiny subset of all states yield high reward. In these cases, few of the interactions with the environment provide a relevant learning signal. Hence, we may want to preferentially train on those high-reward states…

Machine Learning · Computer Science 2019-01-30 Anirudh Goyal , Philemon Brakel , William Fedus , Soumye Singhal , Timothy Lillicrap , Sergey Levine , Hugo Larochelle , Yoshua Bengio

Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Eligibility traces in reinforcement learning are used as a bias-variance trade-off and can often speed up training time by propagating knowledge back over time-steps in a single update. We investigate the use of eligibility traces in…

Artificial Intelligence · Computer Science 2017-04-20 Jean Harb , Doina Precup

Towards Causal Credit Assignment

Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in…

Machine Learning · Computer Science 2023-05-18 Mátyás Schubert

Hindsight Credit Assignment

We consider the problem of efficient credit assignment in reinforcement learning. In order to efficiently and meaningfully utilize new data, we propose to explicitly assign credit to past decisions based on the likelihood of them having led…

Machine Learning · Computer Science 2019-12-06 Anna Harutyunyan , Will Dabney , Thomas Mesnard , Mohammad Azar , Bilal Piot , Nicolas Heess , Hado van Hasselt , Greg Wayne , Satinder Singh , Doina Precup , Remi Munos

Selective Credit Assignment

Efficient credit assignment is essential for reinforcement learning algorithms in both prediction and control settings. We describe a unified view on temporal-difference algorithms for selective credit assignment. These selective algorithms…

Machine Learning · Computer Science 2022-02-22 Veronica Chelu , Diana Borsa , Doina Precup , Hado van Hasselt

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity.…

Machine Learning · Computer Science 2024-06-05 Aditya A. Ramesh , Kenny Young , Louis Kirsch , Jürgen Schmidhuber

Continual Auxiliary Task Learning

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there…

Machine Learning · Computer Science 2022-02-24 Matthew McLeod , Chunlok Lo , Matthew Schlegel , Andrew Jacobsen , Raksha Kumaraswamy , Martha White , Adam White

Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents. In the extreme case, long trajectories of behavior are merely punctuated…

Machine Learning · Computer Science 2023-08-22 Akash Velu , Skanda Vaidyanath , Dilip Arumugam

Delayed Rewards Calibration via Reward Empirical Sufficiency

Appropriate credit assignment for delay rewards is a fundamental challenge for reinforcement learning. To tackle this problem, we introduce a delay reward calibration paradigm inspired from a classification perspective. We hypothesize that…

Machine Learning · Computer Science 2021-08-26 Yixuan Liu , Hu Wang , Xiaowei Wang , Xiaoyue Sun , Liuyue Jiang , Minhui Xue

Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems

Modern AI systems are typically developed through multiple stages-pretraining, fine-tuning rounds, and subsequent adaptation or alignment, where each stage builds on the previous ones and updates the model in distinct ways. This raises a…

Machine Learning · Computer Science 2026-02-10 Shichang Zhang , Hongzhe Du , Jiaqi W. Ma , Himabindu Lakkaraju

Source Traces for Temporal Difference Learning

This paper motivates and develops source traces for temporal difference (TD) learning in the tabular setting. Source traces are like eligibility traces, but model potential histories rather than immediate ones. This allows TD errors to be…

Machine Learning · Computer Science 2019-02-11 Silviu Pitis

Transfer Learning (TL) offers the potential to accelerate learning by transferring knowledge across tasks. However, it faces critical challenges such as negative transfer, domain adaptation and inefficiency in selecting solid source…

Machine Learning · Computer Science 2025-07-29 Alessandro Capurso , Elia Piccoli , Davide Bacciu

Learning Action-Transferable Policy with Action Embedding

Transfer learning (TL) is a promising way to improve the sample efficiency of reinforcement learning. However, how to efficiently transfer knowledge across tasks with different state-action spaces is investigated at an early stage. Most…

Machine Learning · Computer Science 2021-05-11 Yu Chen , Yingfeng Chen , Zhipeng Hu , Tianpei Yang , Changjie Fan , Yang Yu , Jianye Hao