Related papers: Delayed Stochastic Algorithms for Distributed Weak…

Distributed stochastic optimization with large delays

One of the most widely used methods for solving large-scale stochastic optimization problems is distributed asynchronous stochastic gradient descent (DASGD), a family of algorithms that result from parallelizing stochastic gradient descent…

Optimization and Control · Mathematics 2021-07-08 Zhengyuan Zhou , Panayotis Mertikopoulos , Nicholas Bambos , Peter W. Glynn , Yinyu Ye

Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

We study the asynchronous stochastic gradient descent algorithm for distributed training over $n$ workers which have varying computation and communication frequency over time. In this algorithm, workers compute stochastic gradients in…

Machine Learning · Computer Science 2022-06-17 Anastasia Koloskova , Sebastian U. Stich , Martin Jaggi

Asynchronous Stochastic Optimization Robust to Arbitrary Delays

We consider stochastic optimization with delayed gradients where, at each time step $t$, the algorithm makes an update using a stale stochastic gradient from step $t - d_t$ for some arbitrary delay $d_t$. This setting abstracts asynchronous…

Optimization and Control · Mathematics 2021-11-16 Alon Cohen , Amit Daniely , Yoel Drori , Tomer Koren , Mariano Schain

Distributed stochastic inertial-accelerated methods with delayed derivatives for nonconvex problems

Stochastic gradient methods (SGMs) are predominant approaches for solving stochastic optimization. On smooth nonconvex problems, a few acceleration techniques have been applied to improve the convergence rate of SGMs. However, little…

Optimization and Control · Mathematics 2021-12-24 Yangyang Xu , Yibo Xu , Yonggui Yan , Jie Chen

Distributed Delayed Stochastic Optimization

We analyze the convergence of gradient-based optimization algorithms that base their updates on delayed stochastic gradient information. The main application of our results is to the development of gradient-based distributed optimization…

Optimization and Control · Mathematics 2011-05-02 Alekh Agarwal , John C. Duchi

A Tight Convergence Analysis for Stochastic Gradient Descent with Delayed Updates

We provide tight finite-time convergence bounds for gradient descent and stochastic gradient descent on quadratic functions, when the gradients are delayed and reflect iterates from $\tau$ rounds ago. First, we show that without stochastic…

Optimization and Control · Mathematics 2018-06-28 Yossi Arjevani , Ohad Shamir , Nathan Srebro

AdaDelay: Delay Adaptive Distributed Stochastic Convex Optimization

We study distributed stochastic convex optimization under the delayed gradient model where the server nodes perform parameter updates, while the worker nodes compute stochastic gradients. We discuss, analyze, and experiment with a setup…

Machine Learning · Statistics 2015-08-21 Suvrit Sra , Adams Wei Yu , Mu Li , Alexander J. Smola

A Distributed Flexible Delay-tolerant Proximal Gradient Algorithm

We develop and analyze an asynchronous algorithm for distributed convex optimization when the objective writes a sum of smooth functions, local to each worker, and a non-smooth function. Unlike many existing methods, our distributed…

Optimization and Control · Mathematics 2019-12-13 Konstantin Mishchenko , Franck Iutzeler , Jérôme Malick

Fixed-Point Delayed Subgradient Methods for Nonsmooth Convex Optimization Problems

In this paper, we consider the nonsmooth convex optimization problems over the fixed point constraint sets of firmly nonexpansive operators. To find an optimal solution of the problem, we present an iterative method based on the hybrid…

Optimization and Control · Mathematics 2026-03-23 Ontima Pankoon , Nimit Nimana , Yeol Je Cho

Distributed Dual Subgradient Methods with Averaging and Applications to Grid Optimization

We study finite-time performance of a recently proposed distributed dual subgradient (DDSG) method for convex constrained multi-agent optimization problems. The algorithm enjoys performance guarantees on the last primal iterate, as opposed…

Optimization and Control · Mathematics 2023-07-28 Subhonmesh Bose , Hoa Dinh Nguyen , Haitian Liu , Ye Guo , Thinh T. Doan , Carolyn L. Beck

On Distributed Non-convex Optimization: Projected Subgradient Method For Weakly Convex Problems in Networks

The stochastic subgradient method is a widely-used algorithm for solving large-scale optimization problems arising in machine learning. Often these problems are neither smooth nor convex. Recently, Davis et al. [1-2] characterized the…

Optimization and Control · Mathematics 2021-02-25 Shixiang Chen , Alfredo Garcia , Shahin Shahrampour

Delayed Projection Techniques for Linearly Constrained Problems: Convergence Rates, Acceleration, and Applications

In this work, we study a novel class of projection-based algorithms for linearly constrained problems (LCPs) which have a lot of applications in statistics, optimization, and machine learning. Conventional primal gradient-based methods for…

Optimization and Control · Mathematics 2021-01-06 Xiang Li , Zhihua Zhang

Distributed Constrained Optimization with Delayed Subgradient Information over Time-Varying Network under Adaptive Quantization

In this paper, we consider a distributed constrained optimization problem with delayed subgradient information over the time-varying communication network, where each agent can only communicate with its neighbors and the communication…

Optimization and Control · Mathematics 2021-06-16 Jie Liu , Zhan Yu , Daniel W. C. Ho

Decentralized Stochastic Constrained Optimization via Prox-Linearization

This paper studies consensus-based decentralized stochastic optimization for minimizing possibly non-convex expected objectives with convex non-smooth regularizers and nonlinear functional inequality constraints. We reformulate the…

Optimization and Control · Mathematics 2026-01-29 Shivangi Dubey Sharma , Basil M. Idrees , Lavish Arora , Ketan Rajawat

Stochastic Subgradient Algorithms for Strongly Convex Optimization over Distributed Networks

We study diffusion and consensus based optimization of a sum of unknown convex objective functions over distributed networks. The only access to these functions is through stochastic gradient oracles, each of which is only available at a…

Numerical Analysis · Computer Science 2015-09-01 N. Denizcan Vanli , Muhammed O. Sayin , Suleyman S. Kozat

Linearly Convergent Algorithm with Variance Reduction for Distributed Stochastic Optimization

This paper considers a distributed stochastic strongly convex optimization, where agents connected over a network aim to cooperatively minimize the average of all agents' local cost functions. Due to the stochasticity of gradient estimation…

Optimization and Control · Mathematics 2020-02-17 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Distributed Regularized Dual Gradient Algorithm for Constrained Convex Optimization over Time-Varying Directed Graphs

We investigate a distributed optimization problem over a cooperative multi-agent time-varying network, where each agent has its own decision variables that should be set so as to minimize its individual objective subject to local…

Optimization and Control · Mathematics 2018-05-24 Chuanye Gu , Zhiyou Wu , Jueyou Li

Asynchronous Distributed Optimization with Delay-free Parameters

Existing asynchronous distributed optimization algorithms often use diminishing step-sizes that cause slow practical convergence, or use fixed step-sizes that depend on and decrease with an upper bound of the delays. Not only are such delay…

Optimization and Control · Mathematics 2024-11-08 Xuyang Wu , Changxin Liu , Sindri Magnusson , Mikael Johansson

Scaling up Stochastic Gradient Descent for Non-convex Optimisation

Stochastic gradient descent (SGD) is a widely adopted iterative method for optimizing differentiable objective functions. In this paper, we propose and discuss a novel approach to scale up SGD in applications involving non-convex functions…

Machine Learning · Statistics 2022-10-07 Saad Mohamad , Hamad Alamri , Abdelhamid Bouchachia

Compressed Decentralized Proximal Stochastic Gradient Method for Nonconvex Composite Problems with Heterogeneous Data

We first propose a decentralized proximal stochastic gradient tracking method (DProxSGT) for nonconvex stochastic composite problems, with data heterogeneously distributed on multiple workers in a decentralized connected network. To save…

Optimization and Control · Mathematics 2023-03-01 Yonggui Yan , Jie Chen , Pin-Yu Chen , Xiaodong Cui , Songtao Lu , Yangyang Xu