English
Related papers

Related papers: Expected Eligibility Traces

200 papers

Any reinforcement learning system must be able to identify which past events contributed to observed outcomes, a problem known as credit assignment. A common solution to this problem is to use an eligibility trace to assign credit to…

Machine Learning · Computer Science 2022-07-26 Duncan Bailey , Marcelo G. Mattar

In many daily tasks we make multiple decisions before reaching a goal. In order to learn such sequences of decisions, a mechanism to link earlier actions to later reward is necessary. Reinforcement learning theory suggests two classes of…

Neurons and Cognition · Quantitative Biology 2019-11-13 Marco Lehmann , He Xu , Vasiliki Liakoni , Michael Herzog , Wulfram Gerstner , Kerstin Preuschoff

Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies.…

Machine Learning · Computer Science 2020-05-19 Mingde Zhao , Sitao Luan , Ian Porada , Xiao-Wen Chang , Doina Precup

Temporal-Difference (TD) learning is a standard and very successful reinforcement learning approach, at the core of both algorithms that learn the value of a given policy, as well as algorithms which learn how to improve policies.…

Machine Learning · Computer Science 2020-06-17 Mingde Zhao

Animals often receive information about errors and rewards after a significant delay. For example, there is typically a delay of tens to hundreds of milliseconds between motor actions and visual feedback. The standard approach to handling…

In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit…

Machine Learning · Computer Science 2023-12-21 Dhawal Gupta , Scott M. Jordan , Shreyas Chaudhari , Bo Liu , Philip S. Thomas , Bruno Castro da Silva

Credit assignment problems, for example policy evaluation in RL, often require bootstrapping prediction errors through preceding states \textit{or} maintaining temporally extended memory traces; solutions which are unfavourable or…

Neurons and Cognition · Quantitative Biology 2023-05-16 Tom M George

In many environments only a tiny subset of all states yield high reward. In these cases, few of the interactions with the environment provide a relevant learning signal. Hence, we may want to preferentially train on those high-reward states…

Eligibility traces in reinforcement learning are used as a bias-variance trade-off and can often speed up training time by propagating knowledge back over time-steps in a single update. We investigate the use of eligibility traces in…

Artificial Intelligence · Computer Science 2017-04-20 Jean Harb , Doina Precup

Adequately assigning credit to actions for future outcomes based on their contributions is a long-standing open challenge in Reinforcement Learning. The assumptions of the most commonly used credit assignment method are disadvantageous in…

Machine Learning · Computer Science 2023-05-18 Mátyás Schubert

We consider the problem of efficient credit assignment in reinforcement learning. In order to efficiently and meaningfully utilize new data, we propose to explicitly assign credit to past decisions based on the likelihood of them having led…

Efficient credit assignment is essential for reinforcement learning algorithms in both prediction and control settings. We describe a unified view on temporal-difference algorithms for selective credit assignment. These selective algorithms…

Machine Learning · Computer Science 2022-02-22 Veronica Chelu , Diana Borsa , Doina Precup , Hado van Hasselt

Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity.…

Machine Learning · Computer Science 2024-06-05 Aditya A. Ramesh , Kenny Young , Louis Kirsch , Jürgen Schmidhuber

Learning auxiliary tasks, such as multiple predictions about the world, can provide many benefits to reinforcement learning systems. A variety of off-policy learning algorithms have been developed to learn such predictions, but as yet there…

Machine Learning · Computer Science 2022-02-24 Matthew McLeod , Chunlok Lo , Matthew Schlegel , Andrew Jacobsen , Raksha Kumaraswamy , Martha White , Adam White

Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents. In the extreme case, long trajectories of behavior are merely punctuated…

Machine Learning · Computer Science 2023-08-22 Akash Velu , Skanda Vaidyanath , Dilip Arumugam

Appropriate credit assignment for delay rewards is a fundamental challenge for reinforcement learning. To tackle this problem, we introduce a delay reward calibration paradigm inspired from a classification perspective. We hypothesize that…

Machine Learning · Computer Science 2021-08-26 Yixuan Liu , Hu Wang , Xiaowei Wang , Xiaoyue Sun , Liuyue Jiang , Minhui Xue

Modern AI systems are typically developed through multiple stages-pretraining, fine-tuning rounds, and subsequent adaptation or alignment, where each stage builds on the previous ones and updates the model in distinct ways. This raises a…

Machine Learning · Computer Science 2026-02-10 Shichang Zhang , Hongzhe Du , Jiaqi W. Ma , Himabindu Lakkaraju

This paper motivates and develops source traces for temporal difference (TD) learning in the tabular setting. Source traces are like eligibility traces, but model potential histories rather than immediate ones. This allows TD errors to be…

Machine Learning · Computer Science 2019-02-11 Silviu Pitis

Transfer Learning (TL) offers the potential to accelerate learning by transferring knowledge across tasks. However, it faces critical challenges such as negative transfer, domain adaptation and inefficiency in selecting solid source…

Machine Learning · Computer Science 2025-07-29 Alessandro Capurso , Elia Piccoli , Davide Bacciu

Transfer learning (TL) is a promising way to improve the sample efficiency of reinforcement learning. However, how to efficiently transfer knowledge across tasks with different state-action spaces is investigated at an early stage. Most…

Machine Learning · Computer Science 2021-05-11 Yu Chen , Yingfeng Chen , Zhipeng Hu , Tianpei Yang , Changjie Fan , Yang Yu , Jianye Hao
‹ Prev 1 2 3 10 Next ›