Related papers: Distributional reinforcement learning with linear …

An Analysis of Categorical Distributional Reinforcement Learning

Distributional approaches to value-based reinforcement learning model the entire distribution of returns, rather than just their expected values, and have recently been shown to yield state-of-the-art empirical performance. This was…

Machine Learning · Statistics 2018-02-23 Mark Rowland , Marc G. Bellemare , Will Dabney , Rémi Munos , Yee Whye Teh

Distributional Reinforcement Learning via the Cram\'er Distance

This paper explores the application of the Soft Actor-Critic (SAC) algorithm within a Distributional Reinforcement Learning setting and introduces an implementation of such algorithm named Cram\'er-based Distributional Soft Actor-Critic…

Machine Learning · Computer Science 2026-05-12 Vanya Aziz , Ivo Nowak , E. M. T Hendrix

Distributional Reinforcement Learning with Quantile Regression

In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the…

Artificial Intelligence · Computer Science 2017-10-30 Will Dabney , Mark Rowland , Marc G. Bellemare , Rémi Munos

A Comparative Analysis of Expected and Distributional Reinforcement Learning

Since their introduction a year ago, distributional approaches to reinforcement learning (distributional RL) have produced strong results relative to the standard approach which models expected values (expected RL). However, aside from…

Machine Learning · Computer Science 2019-02-22 Clare Lyle , Pablo Samuel Castro , Marc G. Bellemare

A Distributional Analysis of Sampling-Based Reinforcement Learning Algorithms

We present a distributional approach to theoretical analyses of reinforcement learning algorithms for constant step-sizes. We demonstrate its effectiveness by presenting simple and unified proofs of convergence for a variety of…

Machine Learning · Computer Science 2020-03-30 Philip Amortila , Doina Precup , Prakash Panangaden , Marc G. Bellemare

Nonlinear Distributional Gradient Temporal-Difference Learning

We devise a distributional variant of gradient temporal-difference (TD) learning. Distributional reinforcement learning has been demonstrated to outperform the regular one in the recent study \citep{bellemare2017distributional}. In the…

Machine Learning · Computer Science 2019-04-04 Chao Qu , Shie Mannor , Huan Xu

Conjugated Discrete Distributions for Distributional Reinforcement Learning

In this work we continue to build upon recent advances in reinforcement learning for finite Markov processes. A common approach among previous existing algorithms, both single-actor and distributed, is to either clip rewards or to apply a…

Machine Learning · Computer Science 2021-12-15 Björn Lindenberg , Jonas Nordqvist , Karl-Olof Lindahl

A Cram\'er Distance perspective on Quantile Regression based Distributional Reinforcement Learning

Distributional reinforcement learning (DRL) extends the value-based approach by approximating the full distribution over future returns instead of the mean only, providing a richer signal that leads to improved performances. Quantile…

Machine Learning · Statistics 2022-02-23 Alix Lhéritier , Nicolas Bondoux

Reinforcement Learning with Function Approximation: From Linear to Nonlinear

Function approximation has been an indispensable component in modern reinforcement learning algorithms designed to tackle problems with large state spaces in high dimensions. This paper reviews recent results on error analysis for these…

Machine Learning · Computer Science 2024-02-27 Jihao Long , Jiequn Han

Normality-Guided Distributional Reinforcement Learning for Continuous Control

Learning a predictive model of the mean return, or value function, plays a critical role in many reinforcement learning algorithms. Distributional reinforcement learning (DRL) has been shown to improve performance by modeling the value…

Machine Learning · Computer Science 2025-07-08 Ju-Seung Byun , Andrew Perrault

Provably Efficient $Q$-learning with Function Approximation via Distribution Shift Error Checking Oracle

$Q$-learning with function approximation is one of the most popular methods in reinforcement learning. Though the idea of using function approximation was proposed at least 60 years ago, even in the simplest setup, i.e, approximating…

Machine Learning · Computer Science 2019-11-05 Simon S. Du , Yuping Luo , Ruosong Wang , Hanrui Zhang

A Distributional Perspective on Reinforcement Learning

In this paper we argue for the fundamental importance of the value distribution: the distribution of the random return received by a reinforcement learning agent. This is in contrast to the common approach to reinforcement learning which…

Machine Learning · Computer Science 2017-07-24 Marc G. Bellemare , Will Dabney , Rémi Munos

Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation

Among the reasons hindering reinforcement learning (RL) applications to real-world problems, two factors are critical: limited data and the mismatch between the testing environment (real environment in which the policy is deployed) and the…

Machine Learning · Computer Science 2023-01-30 Xiaoteng Ma , Zhipeng Liang , Jose Blanchet , Mingwen Liu , Li Xia , Jiheng Zhang , Qianchuan Zhao , Zhengyuan Zhou

Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion

Distributional reinforcement learning algorithms have attempted to utilize estimated uncertainty for exploration, such as optimism in the face of uncertainty. However, using the estimated variance for optimistic exploration may cause biased…

Machine Learning · Computer Science 2023-12-06 Taehyun Cho , Seungyub Han , Heesoo Lee , Kyungjae Lee , Jungwoo Lee

Policy Evaluation in Distributional LQR (Extended Version)

Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard…

Optimization and Control · Mathematics 2024-03-26 Zifan Wang , Yulong Gao , Siyi Wang , Michael M. Zavlanos , Alessandro Abate , Karl H. Johansson

Policy Evaluation in Distributional LQR

Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard RL. At the…

Optimization and Control · Mathematics 2023-03-27 Zifan Wang , Yulong Gao , Siyi Wang , Michael M. Zavlanos , Alessandro Abate , Karl H. Johansson

A Differential Perspective on Distributional Reinforcement Learning

To date, distributional reinforcement learning (distributional RL) methods have exclusively focused on the discounted setting, where an agent aims to optimize a discounted sum of rewards over time. In this work, we extend distributional RL…

Machine Learning · Computer Science 2026-01-14 Juan Sebastian Rojas , Chi-Guhn Lee

ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm

In most value-based reinforcement learning (RL) algorithms, the agent estimates only the expected reward for each action and selects the action with the highest reward. In contrast, Distributional Reinforcement Learning (DRL) estimates the…

Machine Learning · Computer Science 2025-10-20 Rijul Tandon , Peter Vamplew , Cameron Foale

On Policy Evaluation Algorithms in Distributional Reinforcement Learning

We introduce a novel class of algorithms to efficiently approximate the unknown return distributions in policy evaluation problems from distributional reinforcement learning (DRL). The proposed distributional dynamic programming algorithms…

Machine Learning · Statistics 2024-07-22 Julian Gerstenberg , Ralph Neininger , Denis Spiegel

Variance Control for Distributional Reinforcement Learning

Although distributional reinforcement learning (DRL) has been widely examined in the past few years, very few studies investigate the validity of the obtained Q-function estimator in the distributional setting. To fully understand how the…

Machine Learning · Computer Science 2023-08-01 Qi Kuang , Zhoufan Zhu , Liwen Zhang , Fan Zhou