Related papers: Distributed Recursion Revisited

A Deep Reinforcement Learning Approach to Efficient Distributed Optimization

In distributed optimization, the practical problem-solving performance is essentially sensitive to algorithm selection, parameter setting, problem type and data pattern. Thus, it is often laborious to acquire a highly efficient method for a…

Optimization and Control · Mathematics 2024-01-04 Daokuan Zhu , Tianqi Xu , Jie Lu

On Policy Evaluation Algorithms in Distributional Reinforcement Learning

We introduce a novel class of algorithms to efficiently approximate the unknown return distributions in policy evaluation problems from distributional reinforcement learning (DRL). The proposed distributional dynamic programming algorithms…

Machine Learning · Statistics 2024-07-22 Julian Gerstenberg , Ralph Neininger , Denis Spiegel

Solving Quadratic Programs via Deep Unrolled Douglas-Rachford Splitting

Convex quadratic programs (QPs) are fundamental to numerous applications, including finance, engineering, and energy systems. Among the various methods for solving them, the Douglas-Rachford (DR) splitting algorithm is notable for its…

Optimization and Control · Mathematics 2025-08-19 Jinxin Xiong , Xi Gao , Linxin Yang , Jiang Xue , Xiaodong Luo , Akang Wang

Distributed Optimization via Adaptive Regularization for Large Problems with Separable Constraints

Many practical applications require solving an optimization over large and high-dimensional data sets, which makes these problems hard to solve and prohibitively time consuming. In this paper, we propose a parallel distributed algorithm…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-12-03 Elad Gilboa , Phani Chavali , Peng Yang , Arye Nehorai

Reducing the dimensionality of data using tempered distributions

We reformulate unsupervised dimension reduction problem (UDR) in the language of tempered distributions, i.e. as a problem of approximating an empirical probability density function by another tempered distribution, supported in a…

Statistics Theory · Mathematics 2022-11-08 Rustem Takhanov

A Dynamic Penalty Function Approach for Constraints-Handling in Reinforcement Learning

Reinforcement learning (RL) is attracting attention as an effective way to solve sequential optimization problems that involve high dimensional state/action space and stochastic uncertainties. Many such problems involve constraints…

Machine Learning · Computer Science 2021-04-01 Haeun Yoo , Victor M. Zavala , Jay H. Lee

Aligning Distributionally Robust Optimization with Practical Deep Learning Needs

While traditional Deep Learning (DL) optimization methods treat all training samples equally, Distributionally Robust Optimization (DRO) adaptively assigns importance weights to different samples. However, a significant gap exists between…

Machine Learning · Computer Science 2025-09-26 Dmitrii Feoktistov , Igor Ignashin , Andrey Veprikov , Nikita Borovko , Alexander Bogdanov , Savelii Chezhegov , Aleksandr Beznosikov

Policy Evaluation in Distributional LQR (Extended Version)

Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard…

Optimization and Control · Mathematics 2024-03-26 Zifan Wang , Yulong Gao , Siyi Wang , Michael M. Zavlanos , Alessandro Abate , Karl H. Johansson

Optimized Dimensionality Reduction for Moment-based Distributionally Robust Optimization

Moment-based distributionally robust optimization (DRO) provides an optimization framework to integrate statistical information with traditional optimization approaches. Under this framework, one assumes that the underlying joint…

Optimization and Control · Mathematics 2023-11-01 Shiyi Jiang , Jianqiang Cheng , Kai Pan , Zuo-Jun Max Shen

Statistics and Samples in Distributional Reinforcement Learning

We present a unifying framework for designing and analysing distributional reinforcement learning (DRL) algorithms in terms of recursively estimating statistics of the return distribution. Our key insight is that DRL algorithms can be…

Machine Learning · Statistics 2019-02-22 Mark Rowland , Robert Dadashi , Saurabh Kumar , Rémi Munos , Marc G. Bellemare , Will Dabney

Policy Evaluation in Distributional LQR

Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard RL. At the…

Optimization and Control · Mathematics 2023-03-27 Zifan Wang , Yulong Gao , Siyi Wang , Michael M. Zavlanos , Alessandro Abate , Karl H. Johansson

Network Optimization via Smooth Exact Penalty Functions Enabled by Distributed Gradient Computation

This paper proposes a distributed algorithm for a network of agents to solve an optimization problem with separable objective function and locally coupled constraints. Our strategy is based on reformulating the original constrained problem…

Optimization and Control · Mathematics 2021-03-12 Priyank Srivastava , Jorge Cortes

Sample-based Distributional Policy Gradient

Distributional reinforcement learning (DRL) is a recent reinforcement learning framework whose success has been supported by various empirical studies. It relies on the key idea of replacing the expected return with the return distribution,…

Machine Learning · Computer Science 2020-01-09 Rahul Singh , Keuntaek Lee , Yongxin Chen

Extended reverse-convex programming: an approximate enumeration approach to global optimization

A new approach to solving a large class of factorable nonlinear programming (NLP) problems to global optimality is presented in this paper. Unlike the traditional strategy of partitioning the decision-variable space employed in many…

Optimization and Control · Mathematics 2015-04-28 Gene A. Bunin

Distributed Constrained Optimization by Consensus-Based Primal-Dual Perturbation Method

Various distributed optimization methods have been developed for solving problems which have simple local constraint sets and whose objective function is the sum of local cost functions of distributed agents in a network. Motivated by…

Systems and Control · Computer Science 2016-11-17 Tsung-Hui Chang , Angelia Nedić , Anna Scaglione

Distribution Matching Distillation Meets Reinforcement Learning

Distribution Matching Distillation (DMD) facilitates efficient inference by distilling multi-step diffusion models into few-step variants. Concurrently, Reinforcement Learning (RL) has emerged as a vital tool for aligning generative models…

Computer Vision and Pattern Recognition · Computer Science 2026-03-26 Dengyang Jiang , Dongyang Liu , Zanyi Wang , Qilong Wu , Liuzhuozheng Li , Hengzhuang Li , Xin Jin , David Liu , Changsheng Lu , Zhen Li , Bo Zhang , Mengmeng Wang , Steven Hoi , Peng Gao , Harry Yang

Optimistic Distributionally Robust Policy Optimization

Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO), as the widely employed policy based reinforcement learning (RL) methods, are prone to converge to a sub-optimal solution as they limit the policy representation…

Machine Learning · Computer Science 2020-06-16 Jun Song , Chaoyue Zhao

Partitioned Linear Programming Approximations for MDPs

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal value function by a set of basis functions and optimize…

Artificial Intelligence · Computer Science 2012-06-18 Branislav Kveton , Milos Hauskrecht

A Rollout-Based Algorithm and Reward Function for Resource Allocation in Business Processes

Resource allocation plays a critical role in minimizing cycle time and improving the efficiency of business processes. Recently, Deep Reinforcement Learning (DRL) has emerged as a powerful technique to optimize resource allocation policies…

Machine Learning · Computer Science 2025-09-03 Jeroen Middelhuis , Zaharah Bukhsh , Ivo Adan , Remco Dijkman

Towards Efficient Large-Scale Network Slicing: An LP Dynamic Rounding-and-Refinement Approach

In this paper, we propose an efficient algorithm for the network slicing problem which attempts to map multiple customized virtual network requests (also called services) to a common shared network infrastructure and allocate network…

Information Theory · Computer Science 2023-02-14 Wei-Kun Chen , Ya-Feng Liu , Fan Liu , Yu-Hong Dai , Zhi-Quan Luo