Related papers: DSA: Decentralized Double Stochastic Averaging Gra…

A unitary distributed subgradient method for multi-agent optimization with different coupling sources

In this work, we first consider distributed convex constrained optimization problems where the objective function is encoded by multiple local and possibly nonsmooth objectives privately held by a group of agents, and propose a distributed…

Optimization and Control · Mathematics 2020-02-20 Changxin Liu , Huiping Li , Yang Shi

Decentralized Composite Optimization in Stochastic Networks: A Dual Averaging Approach with Linear Convergence

Decentralized optimization, particularly the class of decentralized composite convex optimization (DCCO) problems, has found many applications. Due to ubiquitous communication congestion and random dropouts in practice, it is highly…

Optimization and Control · Mathematics 2022-10-12 Changxin Liu , Zirui Zhou , Jian Pei , Yong Zhang , Yang Shi

Accelerated Dual Averaging Methods for Decentralized Constrained Optimization

In this work, we study decentralized convex constrained optimization problems in networks. We focus on the dual averaging-based algorithmic framework that is well-documented to be superior in handling constraints and complex communication…

Optimization and Control · Mathematics 2022-08-16 Changxin Liu , Yang Shi , Huiping Li , Wenli Du

Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling

The goal of decentralized optimization over a network is to optimize a global objective formed by a sum of local (possibly nonsmooth) convex functions using only local computation and communication. It arises in various application domains,…

Optimization and Control · Mathematics 2015-03-17 John Duchi , Alekh Agarwal , Martin Wainwright

Dynamic Stochastic Approximation for Multi-stage Stochastic Optimization

In this paper, we consider multi-stage stochastic optimization problems with convex objectives and conic constraints at each stage. We present a new stochastic first-order method, namely the dynamic stochastic approximation (DSA) algorithm,…

Optimization and Control · Mathematics 2019-08-22 Guanghui Lan , Zhiqiang Zhou

Balancing Rates and Variance via Adaptive Batch-Size for Stochastic Optimization Problems

Stochastic gradient descent is a canonical tool for addressing stochastic optimization problems, and forms the bedrock of modern machine learning and statistics. In this work, we seek to balance the fact that attenuating step-size is…

Signal Processing · Electrical Eng. & Systems 2020-07-10 Zhan Gao , Alec Koppel , Alejandro Ribeiro

Distributed Stochastic Gradient Method for Non-Convex Problems with Applications in Supervised Learning

We develop a distributed stochastic gradient descent algorithm for solving non-convex optimization problems under the assumption that the local objective functions are twice continuously differentiable with Lipschitz continuous gradients…

Optimization and Control · Mathematics 2019-08-20 Jemin George , Tao Yang , He Bai , Prudhvi Gurram

S-DIGing: A Stochastic Gradient Tracking Algorithm for Distributed Optimization

In this paper, we study convex optimization problems where agents of a network cooperatively minimize the global objective function which consists of multiple local objective functions. Different from most of the existing works, the local…

Optimization and Control · Mathematics 2024-10-30 Huaqing Li , Lifeng Zheng , Zheng Wang , Yu Yan , Liping Feng , Jing Guo

A Unified and Refined Convergence Analysis for Non-Convex Decentralized Learning

We study the consensus decentralized optimization problem where the objective function is the average of $n$ agents private non-convex cost functions; moreover, the agents can only communicate to their neighbors on a given network topology.…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-07-20 Sulaiman A. Alghunaim , Kun Yuan

Gradient Descent Averaging and Primal-dual Averaging for Strongly Convex Optimization

Averaging scheme has attracted extensive attention in deep learning as well as traditional machine learning. It achieves theoretically optimal convergence and also improves the empirical model performance. However, there is still a lack of…

Machine Learning · Computer Science 2021-01-19 Wei Tao , Wei Li , Zhisong Pan , Qing Tao

Decentralized Non-convex Stochastic Optimization with Heterogeneous Variance

Decentralized optimization is critical for solving large-scale machine learning problems over distributed networks, where multiple nodes collaborate through local communication. In practice, the variances of stochastic gradient estimators…

Optimization and Control · Mathematics 2026-02-13 Hongxu Chen , Ke Wei , Luo Luo

Double Averaging and Gradient Projection: Convergence Guarantees for Decentralized Constrained Optimization

We consider a generic decentralized constrained optimization problem over static, directed communication networks, where each agent has exclusive access to only one convex, differentiable, local objective term and one convex constraint set.…

Optimization and Control · Mathematics 2023-11-09 Firooz Shahriari-Mehr , Ashkan Panahi

Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees

The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based…

Optimization and Control · Mathematics 2020-09-17 Brian Swenson , Soummya Kar , H. Vincent Poor , José M. F. Moura , Aaron Jaech

Asynchronous Decentralized Successive Convex Approximation

We study decentralized asynchronous multiagent optimization over networks, modeled as static (possibly directed) graphs. The optimization problem consists of minimizing a (possibly nonconvex) smooth function--the sum of the agents' local…

Optimization and Control · Mathematics 2020-02-03 Ye Tian , Ying Sun , Gesualdo Scutari

Dual Averaging Converges for Nonconvex Smooth Stochastic Optimization

Dual averaging and gradient descent with their stochastic variants stand as the two canonical recipe books for first-order optimization: Every modern variant can be viewed as a descendant of one or the other. In the convex regime, these…

Optimization and Control · Mathematics 2025-05-28 Tuo Liu , El Mehdi Saad , Wojciech Kotłowski , Francesco Orabona

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: A Joint Gradient Estimation and Tracking Approach

Many modern large-scale machine learning problems benefit from decentralized and stochastic optimization. Recent works have shown that utilizing both decentralized computing and local stochastic gradient estimates can outperform…

Optimization and Control · Mathematics 2020-11-06 Haoran Sun , Songtao Lu , Mingyi Hong

DSPG: Decentralized Simultaneous Perturbations Gradient Descent Scheme

Distributed descent-based methods are an essential toolset to solving optimization problems in multi-agent system scenarios. Here the agents seek to optimize a global objective function through mutual cooperation. Oftentimes, cooperation is…

Optimization and Control · Mathematics 2019-08-28 Arunselvan Ramaswamy

Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm

The growing size of available data has attracted increasing interest in solving minimax problems in a decentralized manner for various machine learning tasks. Previous theoretical research has primarily focused on the convergence rate and…

Machine Learning · Computer Science 2023-11-01 Miaoxi Zhu , Li Shen , Bo Du , Dacheng Tao

DADA: Dual Averaging with Distance Adaptation

We present a novel universal gradient method for solving convex optimization problems. Our algorithm, Dual Averaging with Distance Adaptation (DADA), is based on the classical scheme of dual averaging and dynamically adjusts its…

Optimization and Control · Mathematics 2026-04-22 Mohammad Moshtaghifar , Anton Rodomanov , Daniil Vankov , Sebastian Stich

Stochastic DCA for minimizing a large sum of DC functions with application to Multi-class Logistic Regression

We consider the large sum of DC (Difference of Convex) functions minimization problem which appear in several different areas, especially in stochastic optimization and machine learning. Two DCA (DC Algorithm) based algorithms are proposed:…

Optimization and Control · Mathematics 2019-11-12 Hoai An Le Thi , Hoai Minh Le , Duy Nhat Phan , Bach Tran