Related papers: A Primal-Dual SGD Algorithm for Distributed Noncon…

Accelerated Primal-Dual Algorithm for Distributed Non-convex Optimization

This paper investigates accelerating the convergence of distributed optimization algorithms on non-convex problems. We propose a distributed primal-dual stochastic gradient descent~(SGD) equipped with "powerball" method to accelerate. We…

Optimization and Control · Mathematics 2021-10-15 Shengjun Zhang , Colleen P. Bailey

A Communication-Efficient Stochastic Gradient Descent Algorithm for Distributed Nonconvex Optimization

This paper studies distributed nonconvex optimization problems with stochastic gradients for a multi-agent system, in which each agent aims to minimize the sum of all agents' cost functions by using local compressed information exchange. We…

Optimization and Control · Mathematics 2024-03-05 Antai Xie , Xinlei Yi , Xiaofan Wang , Ming Cao , Xiaoqiang Ren

Linear Convergence of First- and Zeroth-Order Primal-Dual Algorithms for Distributed Nonconvex Optimization

This paper considers the distributed nonconvex optimization problem of minimizing a global cost function formed by a sum of local cost functions by using local information exchange. We first consider a distributed first-order primal-dual…

Optimization and Control · Mathematics 2021-08-26 Xinlei Yi , Shengjun Zhang , Tao Yang , Tianyou Chai , Karl H. Johansson

Scaling up Stochastic Gradient Descent for Non-convex Optimisation

Stochastic gradient descent (SGD) is a widely adopted iterative method for optimizing differentiable objective functions. In this paper, we propose and discuss a novel approach to scale up SGD in applications involving non-convex functions…

Machine Learning · Statistics 2022-10-07 Saad Mohamad , Hamad Alamri , Abdelhamid Bouchachia

Quantized Distributed Nonconvex Optimization Algorithms with Linear Convergence under the Polyak--${\L}$ojasiewicz Condition

This paper considers distributed optimization for minimizing the average of local nonconvex cost functions, by using local information exchange over undirected communication networks. To reduce the required communication capacity, we…

Optimization and Control · Mathematics 2025-03-03 Lei Xu , Xinlei Yi , Jiayue Sun , Yang Shi , Karl H. Johansson , Tao Yang

Convergence Analysis of Nonconvex Distributed Stochastic Zeroth-order Coordinate Method

This paper investigates the stochastic distributed nonconvex optimization problem of minimizing a global cost function formed by the summation of $n$ local cost functions. We solve such a problem by involving zeroth-order (ZO) information…

Optimization and Control · Mathematics 2021-10-15 Shengjun Zhang , Yunlong Dong , Dong Xie , Lisha Yao , Colleen P. Bailey , Shengli Fu

Asynchronous Distributed Optimization via Randomized Dual Proximal Gradient

In this paper we consider distributed optimization problems in which the cost function is separable, i.e., a sum of possibly non-smooth functions all sharing a common variable, and can be split into a strongly convex term and a convex one.…

Systems and Control · Computer Science 2016-06-27 Ivano Notarnicola , Giuseppe Notarstefano

Better Theory for SGD in the Nonconvex World

Large-scale nonconvex optimization problems are ubiquitous in modern machine learning, and among practitioners interested in solving them, Stochastic Gradient Descent (SGD) reigns supreme. We revisit the analysis of SGD in the nonconvex…

Optimization and Control · Mathematics 2020-07-27 Ahmed Khaled , Peter Richtárik

A Smooth Double Proximal Primal-Dual Algorithm for a Class of Distributed Nonsmooth Optimization Problem

This technical note studies a class of distributed nonsmooth convex consensus optimization problem. The cost function is a summation of local cost functions which are convex but nonsmooth. Each of the local cost functions consists of a…

Optimization and Control · Mathematics 2018-08-17 Yue Wei , Hao Fang , Xianlin Zeng , Jie Chen , Panos M. Pardalos

Distributed Gradient Methods for Nonconvex Optimization: Local and Global Convergence Guarantees

The article discusses distributed gradient-descent algorithms for computing local and global minima in nonconvex optimization. For local optimization, we focus on distributed stochastic gradient descent (D-SGD)--a simple network-based…

Optimization and Control · Mathematics 2020-09-17 Brian Swenson , Soummya Kar , H. Vincent Poor , José M. F. Moura , Aaron Jaech

Zeroth-Order Algorithms for Stochastic Distributed Nonconvex Optimization

In this paper, we consider a stochastic distributed nonconvex optimization problem with the cost function being distributed over $n$ agents having access only to zeroth-order (ZO) information of the cost. This problem has various machine…

Optimization and Control · Mathematics 2022-01-11 Xinlei Yi , Shengjun Zhang , Tao Yang , Karl H. Johansson

Local Stochastic Gradient Descent Ascent: Convergence Analysis and Communication Efficiency

Local SGD is a promising approach to overcome the communication overhead in distributed learning by reducing the synchronization frequency among worker nodes. Despite the recent theoretical advances of local SGD in empirical risk…

Machine Learning · Computer Science 2021-03-01 Yuyang Deng , Mehrdad Mahdavi

Distributed Optimization with Coupled Constraints over Time-Varying Digraph

In this paper, we develop a distributed algorithm for solving a class of distributed convex optimization problems where the local objective functions can be a general nonsmooth function, and all equalities and inequalities are network-wide…

Optimization and Control · Mathematics 2026-04-14 Yeong-Ung Kim , Hyo-Sung Ahn

Distributed Asynchronous Dual Free Stochastic Dual Coordinate Ascent

The primal-dual distributed optimization methods have broad large-scale machine learning applications. Previous primal-dual distributed methods are not applicable when the dual formulation is not available, e.g. the sum-of-non-convex…

Machine Learning · Computer Science 2017-10-30 Zhouyuan Huo , Heng Huang

Compressed Distributed Stochastic Nonconvex Optimization with Differential Privacy

This paper studies distributed stochastic nonconvex optimization problems with compressed communication and differential privacy, in which each agent aims to minimize the sum of all agents' cost functions by using local compressed…

Optimization and Control · Mathematics 2026-03-24 Antai Xie , Xiaoqiang Ren , Xinlei Yi , Tao Yang , Xiaofan Wang

Stochastic Subgradient Algorithms for Strongly Convex Optimization over Distributed Networks

We study diffusion and consensus based optimization of a sum of unknown convex objective functions over distributed networks. The only access to these functions is through stochastic gradient oracles, each of which is only available at a…

Numerical Analysis · Computer Science 2015-09-01 N. Denizcan Vanli , Muhammed O. Sayin , Suleyman S. Kozat

signProx: One-Bit Proximal Algorithm for Nonconvex Stochastic Optimization

Stochastic gradient descent (SGD) is one of the most widely used optimization methods for parallel and distributed processing of large datasets. One of the key limitations of distributed SGD is the need to regularly communicate the…

Optimization and Control · Mathematics 2018-10-25 Xiaojian Xu , Ulugbek S. Kamilov

A Hybrid-Order Distributed SGD Method for Non-Convex Optimization to Balance Communication Overhead, Computational Complexity, and Convergence Rate

In this paper, we propose a method of distributed stochastic gradient descent (SGD), with low communication load and computational complexity, and still fast convergence. To reduce the communication load, at each iteration of the algorithm,…

Machine Learning · Computer Science 2020-03-30 Naeimeh Omidvar , Mohammad Ali Maddah-Ali , Hamed Mahdavi

Tackling benign nonconvexity with smoothing and stochastic gradients

Non-convex optimization problems are ubiquitous in machine learning, especially in Deep Learning. While such complex problems can often be successfully optimized in practice by using stochastic gradient descent (SGD), theoretical analysis…

Machine Learning · Computer Science 2022-02-21 Harsh Vardhan , Sebastian U. Stich

Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization

Communication overhead is one of the key challenges that hinders the scalability of distributed optimization algorithms. In this paper, we study local distributed SGD, where data is partitioned among computation nodes, and the computation…

Machine Learning · Computer Science 2020-05-15 Farzin Haddadpour , Mohammad Mahdi Kamani , Mehrdad Mahdavi , Viveck R. Cadambe