Related papers: A Block Coordinate Ascent Algorithm for Mean-Varia…

Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy

While deep reinforcement learning has achieved tremendous successes in various applications, most existing works only focus on maximizing the expected value of total return and thus ignore its inherent stochasticity. Such stochasticity is…

Machine Learning · Computer Science 2023-09-19 Han Zhong , Xun Deng , Ethan X. Fang , Zhuoran Yang , Zhaoran Wang , Runze Li

Global Algorithms for Mean-Variance Optimization in Markov Decision Processes

Dynamic optimization of mean and variance in Markov decision processes (MDPs) is a long-standing challenge caused by the failure of dynamic programming. In this paper, we propose a new approach to find the globally optimal policy for…

Optimization and Control · Mathematics 2023-02-28 Li Xia , Shuai Ma

Mean-Variance Portfolio Management with Functional Optimization

This paper introduces a new functional optimization approach to portfolio optimization problems by treating the unknown weight vector as a function of past values instead of treating them as fixed unknown coefficients in the majority of…

Portfolio Management · Quantitative Finance 2020-12-10 Ka Wai Tsang , Zhaoyi He

A Frank-Wolfe Based Branch-and-Bound Algorithm for Mean-Risk Optimization

We present an exact algorithm for mean-risk optimization subject to a budget constraint, where decision variables may be continuous or integer. The risk is measured by the covariance matrix and weighted by an arbitrary monotone function,…

Optimization and Control · Mathematics 2017-05-08 Christoph Buchheim , Marianna De Santis , Francesco Rinaldi , Long Trieu

Mean-Variance Optimization and Algorithm for Finite-Horizon Markov Decision Processes

Multi-period mean-variance optimization is a long-standing problem, caused by the failure of dynamic programming principle. This paper studies the mean-variance optimization in a setting of finite-horizon discrete-time Markov decision…

Optimization and Control · Mathematics 2025-07-31 Li Xia , Zhihui Yu

Parallel Random Block-Coordinate Forward-Backward Algorithm: A Unified Convergence Analysis

We study the block-coordinate forward-backward algorithm in which the blocks are updated in a random and possibly parallel manner, according to arbitrary probabilities. The algorithm allows different stepsizes along the block-coordinates to…

Optimization and Control · Mathematics 2020-11-30 Saverio Salzo , Silvia Villa

A unified algorithm framework for mean-variance optimization in discounted Markov decision processes

This paper studies the risk-averse mean-variance optimization in infinite-horizon discounted Markov decision processes (MDPs). The involved variance metric concerns reward variability during the whole process, and future deviations are…

Optimization and Control · Mathematics 2022-01-19 Shuai Ma , Xiaoteng Ma , Li Xia

Mean-Variance Optimization of Discrete Time Discounted Markov Decision Processes

In this paper, we study a mean-variance optimization problem in an infinite horizon discrete time discounted Markov decision process (MDP). The objective is to minimize the variance of system rewards with the constraint of mean performance.…

Optimization and Control · Mathematics 2017-08-24 Li Xia

Stochastic Approximation for Risk-aware Markov Decision Processes

We develop a stochastic approximation-type algorithm to solve finite state/action, infinite-horizon, risk-aware Markov decision processes. Our algorithm has two loops. The inner loop computes the risk by solving a stochastic saddle-point…

Optimization and Control · Mathematics 2019-12-05 Wenjie Huang , William B. Haskell

Inexact Variable Metric Stochastic Block-Coordinate Descent for Regularized Optimization

Block-coordinate descent (BCD) is a popular framework for large-scale regularized optimization problems with block-separable structure. Existing methods have several limitations. They often assume that subproblems can be solved exactly at…

Optimization and Control · Mathematics 2019-11-05 Ching-pei Lee , Stephen J. Wright

Convergence rate of random scan Coordinate Ascent Variational Inference under log-concavity

The Coordinate Ascent Variational Inference scheme is a popular algorithm used to compute the mean-field approximation of a probability distribution of interest. We analyze its random scan version, under log-concavity assumptions on the…

Machine Learning · Statistics 2024-09-24 Hugo Lavenant , Giacomo Zanella

Theoretical and Computational Guarantees of Mean Field Variational Inference for Community Detection

The mean field variational Bayes method is becoming increasingly popular in statistics and machine learning. Its iterative Coordinate Ascent Variational Inference algorithm has been widely applied to large scale Bayesian inference. See Blei…

Statistics Theory · Mathematics 2017-12-12 Anderson Y. Zhang , Harrison H. Zhou

Risk-Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance

This paper investigates the optimization problem of an infinite stage discrete time Markov decision process (MDP) with a long-run average metric considering both mean and variance of rewards together. Such performance metric is important…

Optimization and Control · Mathematics 2020-08-11 Li Xia

Accelerated Point-wise Maximum Approach to Approximate Dynamic Programming

We describe an approximate dynamic programming approach to compute lower bounds on the optimal value function for a discrete time, continuous space, infinite horizon setting. The approach iteratively constructs a family of lower bounding…

Systems and Control · Electrical Eng. & Systems 2024-12-20 Paul N. Beuchat , Joseph Warrington , John Lygeros

Optimal variance-reduced stochastic approximation in Banach spaces

We study the problem of estimating the fixed point of a contractive operator defined on a separable Banach space. Focusing on a stochastic query model that provides noisy evaluations of the operator, we analyze a variance-reduced stochastic…

Statistics Theory · Mathematics 2022-11-30 Wenlong Mou , Koulik Khamaru , Martin J. Wainwright , Peter L. Bartlett , Michael I. Jordan

Inexact Block Coordinate Descent Algorithms for Nonsmooth Nonconvex Optimization

In this paper, we propose an inexact block coordinate descent algorithm for large-scale nonsmooth nonconvex optimization problems. At each iteration, a particular block variable is selected and updated by inexactly solving the original…

Optimization and Control · Mathematics 2019-12-12 Yang Yang , Marius Pesavento , Zhi-Quan Luo , Björn Ottersten

Fixed-point equations solving Risk-sensitive MDP with constraint

There are no computationally feasible algorithms that provide solutions to the finite horizon Risk-sensitive Constrained Markov Decision Process (Risk-CMDP) problem, even for problems with moderate horizon. With an aim to design the same,…

Optimization and Control · Mathematics 2023-03-27 Vartika Singh , Veeraruna Kavitha

A Gradient Descent-Ascent Method for Continuous-Time Risk-Averse Optimal Control

In this paper, we consider continuous-time stochastic optimal control problems where the cost is evaluated through a coherent risk measure. We provide an explicit gradient descent-ascent algorithm which applies to problems subject to…

Optimization and Control · Mathematics 2023-06-23 Gabriel Velho , Jean Auriol , Riccardo Bonalli

Policy Gradient for Continuing Tasks in Non-stationary Markov Decision Processes

Reinforcement learning considers the problem of finding policies that maximize an expected cumulative reward in a Markov decision process with unknown transition probabilities. In this paper we consider the problem of finding optimal…

Machine Learning · Computer Science 2020-10-19 Santiago Paternain , Juan Andres Bazerque , Alejandro Ribeiro

Deep neural networks algorithms for stochastic control problems on finite horizon: convergence analysis

This paper develops algorithms for high-dimensional stochastic control problems based on deep learning and dynamic programming. Unlike classical approximate dynamic programming approaches, we first approximate the optimal policy by means of…

Probability · Mathematics 2021-09-21 Côme Huré , Huyên Pham , Achref Bachouch , Nicolas Langrené