Related papers: Risk-Averse Learning by Temporal Difference Method…

Optimal Transport-Guided Safety in Temporal Difference Reinforcement Learning

The primary goal of reinforcement learning is to develop decision-making policies that prioritize optimal performance, frequently without considering safety. In contrast, safe reinforcement learning seeks to reduce or avoid unsafe behavior.…

Machine Learning · Computer Science 2025-06-17 Zahra Shahrooei , Ali Baheri

Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures

Traditional reinforcement learning (RL) aims to maximize the expected total reward, while the risk of uncertain outcomes needs to be controlled to ensure reliable performance in a risk-averse setting. In this paper, we consider the problem…

Machine Learning · Computer Science 2023-01-18 Xian Yu , Siqian Shen

Robust Reinforcement Learning with Distributional Risk-averse formulation

Robust Reinforcement Learning tries to make predictions more robust to changes in the dynamics or rewards of the system. This problem is particularly important when the dynamics and rewards of the environment are estimated from the data. In…

Machine Learning · Computer Science 2022-06-15 Pierre Clavier , Stéphanie Allassonière , Erwan Le Pennec

Reinforcement Learning with Dynamic Convex Risk Measures

We develop an approach for solving time-consistent risk-sensitive stochastic optimization problems using model-free reinforcement learning (RL). Specifically, we assume agents assess the risk of a sequence of random variables using dynamic…

Machine Learning · Computer Science 2022-12-01 Anthony Coache , Sebastian Jaimungal

Risk Averse Robust Adversarial Reinforcement Learning

Deep reinforcement learning has recently made significant progress in solving computer games and robotic control tasks. A known problem, though, is that policies overfit to the training environment and may not avoid rare, catastrophic…

Machine Learning · Computer Science 2019-04-02 Xinlei Pan , Daniel Seita , Yang Gao , John Canny

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning

Many real-world domains require safe decision making in uncertain environments. In this work, we introduce a deep reinforcement learning framework for approaching this important problem. We consider a distribution over transition models,…

Machine Learning · Computer Science 2023-10-27 James Queeney , Mouhacine Benosman

Evolutionary Algorithms for Reinforcement Learning

There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these…

Machine Learning · Computer Science 2011-06-02 J. J. Grefenstette , D. E. Moriarty , A. C. Schultz

Epistemic Risk-Sensitive Reinforcement Learning

We develop a framework for interacting with uncertain environments in reinforcement learning (RL) by leveraging preferences in the form of utility functions. We claim that there is value in considering different risk measures during…

Machine Learning · Computer Science 2021-02-23 Hannes Eriksson , Christos Dimitrakakis

Risk-Aware Active Inverse Reinforcement Learning

Active learning from demonstration allows a robot to query a human for specific types of input to achieve efficient learning. Existing work has explored a variety of active query strategies; however, to our knowledge, none of these…

Machine Learning · Computer Science 2019-06-05 Daniel S. Brown , Yuchen Cui , Scott Niekum

Safe Reinforcement Learning with Contrastive Risk Prediction

As safety violations can lead to severe consequences in real-world robotic applications, the increasing deployment of Reinforcement Learning (RL) in robotic domains has propelled the study of safe exploration for reinforcement learning…

Artificial Intelligence · Computer Science 2022-09-21 Hanping Zhang , Yuhong Guo

Risk-averse learning with delayed feedback

In real-world scenarios, risk-averse learning is valuable for mitigating potential adverse outcomes. However, the delayed feedback makes it challenging to assess and manage risk effectively. In this paper, we investigate risk-averse…

Machine Learning · Computer Science 2025-08-06 Siyi Wang , Zifan Wang , Karl Henrik Johansson , Sandra Hirche

Risk-Averse Learning with Varying Risk Levels

In safety-critical decision-making, the environment may evolve over time, and the learner adjusts its risk level accordingly. This work investigates risk-averse online optimization in dynamic environments with varying risk levels, employing…

Optimization and Control · Mathematics 2025-12-30 Siyi Wang , Zifan Wang , Karl H. Johansson

Improving Robustness via Risk Averse Distributional Reinforcement Learning

One major obstacle that precludes the success of reinforcement learning in real-world applications is the lack of robustness, either to model uncertainties or external disturbances, of the trained policies. Robustness is critical when the…

Machine Learning · Computer Science 2020-05-05 Rahul Singh , Qinsheng Zhang , Yongxin Chen

Provably Correct Learning Algorithms in the Presence of Time-Varying Features Using a Variational Perspective

Features in machine learning problems are often time-varying and may be related to outputs in an algebraic or dynamical manner. The dynamic nature of these machine learning problems renders current higher order accelerated gradient descent…

Optimization and Control · Mathematics 2019-05-29 Joseph E. Gaudio , Travis E. Gibson , Anuradha M. Annaswamy , Michael A. Bolender

Distributional Method for Risk Averse Reinforcement Learning

We introduce a distributional method for learning the optimal policy in risk averse Markov decision process with finite state action spaces, latent costs, and stationary dynamics. We assume sequential observations of states, actions, and…

Machine Learning · Computer Science 2023-03-01 Ziteng Cheng , Sebastian Jaimungal , Nick Martin

Directly Estimating the Variance of the {\lambda}-Return Using Temporal-Difference Methods

This paper investigates estimating the variance of a temporal-difference learning agent's update target. Most reinforcement learning methods use an estimate of the value function, which captures how good it is for the agent to be in a…

Artificial Intelligence · Computer Science 2018-02-15 Craig Sherstan , Brendan Bennett , Kenny Young , Dylan R. Ashley , Adam White , Martha White , Richard S. Sutton

Risk-sensitive Reinforcement Learning Based on Convex Scoring Functions

We propose a reinforcement learning (RL) framework under a broad class of risk objectives, characterized by convex scoring functions. This class covers many common risk measures, such as variance, Expected Shortfall, entropic Value-at-Risk,…

Mathematical Finance · Quantitative Finance 2025-05-16 Shanyu Han , Yang Liu , Xiang Yu

Loss Dynamics of Temporal Difference Reinforcement Learning

Reinforcement learning has been successful across several applications in which agents have to learn to act in environments with sparse feedback. However, despite this empirical success there is still a lack of theoretical understanding of…

Machine Learning · Statistics 2023-11-08 Blake Bordelon , Paul Masset , Henry Kuo , Cengiz Pehlevan

Trajectory Planning for Autonomous Vehicle Using Iterative Reward Prediction in Reinforcement Learning

Traditional trajectory planning methods for autonomous vehicles have several limitations. For example, heuristic and explicit simple rules limit generalizability and hinder complex motions. These limitations can be addressed using…

Robotics · Computer Science 2024-05-14 Hyunwoo Park

Objective-aware Traffic Simulation via Inverse Reinforcement Learning

Traffic simulators act as an essential component in the operating and planning of transportation systems. Conventional traffic simulators usually employ a calibrated physical car-following model to describe vehicles' behaviors and their…

Artificial Intelligence · Computer Science 2022-07-12 Guanjie Zheng , Hanyang Liu , Kai Xu , Zhenhui Li