Related papers: Dynamically Augmented CVaR for MDPs

Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach

In this paper we address the problem of decision making within a Markov decision process (MDP) framework where risk and modeling errors are taken into account. Our approach is to minimize a risk-sensitive conditional-value-at-risk (CVaR)…

Artificial Intelligence · Computer Science 2015-06-09 Yinlam Chow , Aviv Tamar , Shie Mannor , Marco Pavone

Algorithms for CVaR Optimization in MDPs

In many sequential decision-making problems we may want to manage risk by minimizing some measure of variability in costs in addition to minimizing a standard criterion. Conditional value-at-risk (CVaR) is a relatively new risk measure that…

Artificial Intelligence · Computer Science 2014-07-14 Yinlam Chow , Mohammad Ghavamzadeh

Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk

Robust Markov Decision Processes (RMDPs) have received significant research interest, offering an alternative to standard Markov Decision Processes (MDPs) that often assume fixed transition probabilities. RMDPs address this by optimizing…

Machine Learning · Computer Science 2024-05-06 Xinyi Ni , Lifeng Lai

Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion

CVaR (Conditional Value at Risk) is a risk metric widely used in finance. However, dynamically optimizing CVaR is difficult since it is not a standard Markov decision process (MDP) and the principle of dynamic programming fails. In this…

Optimization and Control · Mathematics 2022-10-18 Li Xia , Peter W. Glynn

On the Maximization of Long-Run Reward CVaR for Markov Decision Processes

This paper studies the optimization of Markov decision processes (MDPs) from a risk-seeking perspective, where the risk is measured by conditional value-at-risk (CVaR). The objective is to find a policy that maximizes the long-run CVaR of…

Optimization and Control · Mathematics 2023-12-05 Li Xia , Zhihui Yu , Peter W. Glynn

Markov Decision Processes with Value-at-Risk Criterion

Value-at-risk (VaR), also known as quantile, is a crucial risk measure in finance and other fields. However, optimizing VaR metrics in Markov decision processes (MDPs) is challenging because VaR is non-additive and the traditional dynamic…

Optimization and Control · Mathematics 2025-07-31 Li Xia , Jinyan Pan

Long-Run Conditional Value-at-Risk Reinforcement Learning

Conditional value-at-risk (CVaR) is a prominent risk measure in financial engineering, energy systems, and supply chain management. In these domains, Markov decision processes (MDPs) with a long-run CVaR criterion effectively mitigate cost…

Optimization and Control · Mathematics 2026-03-11 Qixin Wang , Hao Cao , Jian-Qiang Hu , Mingjie Hu , Li Xia

Provably Efficient CVaR RL in Low-rank MDPs

We study risk-sensitive Reinforcement Learning (RL), where we aim to maximize the Conditional Value at Risk (CVaR) with a fixed risk tolerance $\tau$. Prior theoretical work studying risk-sensitive RL focuses on the tabular Markov Decision…

Machine Learning · Computer Science 2023-11-21 Yulai Zhao , Wenhao Zhan , Xiaoyan Hu , Ho-fung Leung , Farzan Farnia , Wen Sun , Jason D. Lee

Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds

Risk-averse decision-making under uncertainty in partially observable domains is a central challenge in artificial intelligence and is essential for developing reliable autonomous agents. The formal framework for such problems is the…

Statistics Theory · Mathematics 2026-02-27 Yaacov Pariente , Vadim Indelman

On the Fundamental Limitations of Dual Static CVaR Decompositions in Markov Decision Processes

It was recently shown that dynamic programming (DP) methods for finding static CVaR-optimal policies in Markov Decision Processes (MDPs) can fail when based on the dual formulation, yet the root cause of this failure remains unclear. We…

Machine Learning · Computer Science 2026-04-16 Mathieu Godbout , Audrey Durand

Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity

Tail-end risk measures such as static conditional value-at-risk (CVaR) are used in safety-critical applications to prevent rare, yet catastrophic events. Unlike risk-neutral objectives, the static CVaR of the return depends on entire…

Machine Learning · Computer Science 2026-02-04 Aneri Muni , Vincent Taboga , Esther Derman , Pierre-Luc Bacon , Erick Delage

An Asymptotic CVaR Measure of Risk for Markov Chains

Risk sensitive decision making finds important applications in current day use cases. Existing risk measures consider a single or finite collection of random variables, which do not account for the asymptotic behaviour of underlying…

Risk Management · Quantitative Finance 2024-05-24 Shivam Patel , Vivek Borkar

On Optimizing the Conditional Value-at-Risk of a Maximum Cost for Risk-Averse Safety Analysis

The popularity of Conditional Value-at-Risk (CVaR), a risk functional from finance, has been growing in the control systems community due to its intuitive interpretation and axiomatic foundation. We consider a nonstandard optimal control…

Systems and Control · Electrical Eng. & Systems 2022-06-22 Margaret P. Chapman , Michael Fauss , Kevin M. Smith

On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes

Optimizing static risk-averse objectives in Markov decision processes is difficult because they do not admit standard dynamic programming equations common in Reinforcement Learning (RL) algorithms. Dynamic programming decompositions that…

Optimization and Control · Mathematics 2024-07-04 Jia Lin Hau , Erick Delage , Mohammad Ghavamzadeh , Marek Petrik

Planning for Risk-Aversion and Expected Value in MDPs

Planning in Markov decision processes (MDPs) typically optimises the expected cost. However, optimising the expectation does not consider the risk that for any given run of the MDP, the total cost received may be unacceptably high. An…

Artificial Intelligence · Computer Science 2022-03-11 Marc Rigter , Paul Duckworth , Bruno Lacerda , Nick Hawes

Risk-Constrained Reinforcement Learning with Percentile Risk Criteria

In many sequential decision-making problems one is interested in minimizing an expected cumulative cost while taking into account \emph{risk}, i.e., increased awareness of events of small probability and high consequences. Accordingly, the…

Artificial Intelligence · Computer Science 2017-04-07 Yinlam Chow , Mohammad Ghavamzadeh , Lucas Janson , Marco Pavone

Risk-Averse Bayes-Adaptive Reinforcement Learning

In this work, we address risk-averse Bayes-adaptive reinforcement learning. We pose the problem of optimising the conditional value at risk (CVaR) of the total return in Bayes-adaptive Markov decision processes (MDPs). We show that a policy…

Machine Learning · Computer Science 2021-10-27 Marc Rigter , Bruno Lacerda , Nick Hawes

Gradient-based optimisation of the conditional-value-at-risk using the multi-level Monte Carlo method

In this work, we tackle the problem of minimising the Conditional-Value-at-Risk (CVaR) of output quantities of complex differential models with random input data, using gradient-based approaches in combination with the Multi-Level Monte…

Numerical Analysis · Mathematics 2023-10-16 Sundar Ganesh , Fabio Nobile

Dynamic CoVaR Modeling and Estimation

The popular systemic risk measure CoVaR (conditional Value-at-Risk) and its variants are widely used in economics and finance. In this article, we propose joint dynamic forecasting models for the Value-at-Risk (VaR) and CoVaR. The CoVaR…

Econometrics · Economics 2025-01-22 Timo Dimitriadis , Yannick Hoga

Multi periods mean-DCVaR optimization: a Recursive Neural Network resolution

We study a discrete-time multi-period portfolio optimization problem under an explicit constraint on the Deviation Conditional Value-at-Risk (DCVaR), defined as the excess of Conditional Value-at-Risk over expected terminal wealth. The…

Portfolio Management · Quantitative Finance 2026-04-17 Jérôme Lelong , Véronique Maume-Deschamps , William Thevenot