Related papers: Nested Distributed Gradient Methods with Adaptive …

On the Convergence of Nested Decentralized Gradient Methods with Multiple Consensus and Gradient Steps

In this paper, we consider minimizing a sum of local convex objective functions in a distributed setting, where the cost of communication and/or computation can be expensive. We extend and generalize the analysis for a class of nested…

Optimization and Control · Mathematics 2021-09-01 Albert S. Berahas , Raghu Bollapragada , Ermin Wei

Nested Distributed Gradient Methods with Stochastic Computation Errors

In this work, we consider the problem of a network of agents collectively minimizing a sum of convex functions. The agents in our setting can only access their local objective functions and exchange information with their immediate…

Optimization and Control · Mathematics 2019-10-01 Charikleia Iakovidou , Ermin Wei

DQ-SGD: Dynamic Quantization in SGD for Communication-Efficient Distributed Learning

Gradient quantization is an emerging technique in reducing communication costs in distributed learning. Existing gradient quantization algorithms often rely on engineering heuristics or empirical observations, lacking a systematic approach…

Machine Learning · Computer Science 2021-08-02 Guangfeng Yan , Shao-Lun Huang , Tian Lan , Linqi Song

Fast Convergence Rates of Distributed Subgradient Methods with Adaptive Quantization

We study distributed optimization problems over a network when the communication between the nodes is constrained, and so information that is exchanged between the nodes must be quantized. Recent advances using the distributed gradient…

Optimization and Control · Mathematics 2019-05-14 Thinh T. Doan , Siva Theja Maguluri , Justin Romberg

Balancing Communication and Computation in Distributed Optimization

Methods for distributed optimization have received significant attention in recent years owing to their wide applicability in various domains. A distributed optimization method typically consists of two key components: communication and…

Optimization and Control · Mathematics 2018-06-04 Albert S. Berahas , Raghu Bollapragada , Nitish Shirish Keskar , Ermin Wei

S-NEAR-DGD: A Flexible Distributed Stochastic Gradient Method for Inexact Communication

We present and analyze a stochastic distributed method (S-NEAR-DGD) that can tolerate inexact computation and inaccurate information exchange to alleviate the problems of costly gradient evaluations and bandwidth-limited communication in…

Optimization and Control · Mathematics 2021-02-02 Charikleia Iakovidou , Ermin Wei

Error Compensated Quantized SGD and its Applications to Large-scale Distributed Optimization

Large-scale distributed optimization is of great importance in various applications. For data-parallel based distributed learning, the inter-node gradient communication often becomes the performance bottleneck. In this paper, we propose the…

Computer Vision and Pattern Recognition · Computer Science 2018-06-22 Jiaxiang Wu , Weidong Huang , Junzhou Huang , Tong Zhang

Quantized Distributed Gradient Tracking Algorithm with Linear Convergence in Directed Networks

Communication efficiency is a major bottleneck in the applications of distributed networks. To address the problem, the problem of quantized distributed optimization has attracted a lot of attention. However, most of the existing quantized…

Optimization and Control · Mathematics 2022-11-01 Yongyang Xiong , Ligang Wu , Keyou You , Lihua Xie

Practical Newton-Type Distributed Learning using Gradient Based Approximations

We study distributed algorithms for expected loss minimization where the datasets are large and have to be stored on different machines. Often we deal with minimizing the average of a set of convex functions where each function is the…

Machine Learning · Computer Science 2019-07-24 Samira Sheikhi

Communication-Efficient Distributed Learning via Lazily Aggregated Quantized Gradients

The present paper develops a novel aggregated gradient approach for distributed machine learning that adaptively compresses the gradient communication. The key idea is to first quantize the computed gradients, and then skip less informative…

Machine Learning · Computer Science 2019-09-18 Jun Sun , Tianyi Chen , Georgios B. Giannakis , Zaiyue Yang

Nested Dithered Quantization for Communication Reduction in Distributed Training

In distributed training, the communication cost due to the transmission of gradients or the parameters of the deep model is a major bottleneck in scaling up the number of processing nodes. To address this issue, we propose \emph{dithered…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-03 Afshin Abdi , Faramarz Fekri

Quantized Adam with Error Feedback

In this paper, we present a distributed variant of adaptive stochastic gradient method for training deep neural networks in the parameter-server model. To reduce the communication cost among the workers and server, we incorporate two types…

Machine Learning · Computer Science 2021-06-16 Congliang Chen , Li Shen , Haozhi Huang , Wei Liu

Convergence results of a nested decentralized gradient method for non-strongly convex problems

We are concerned with the convergence of NEAR-DGD$^+$ (Nested Exact Alternating Recursion Distributed Gradient Descent) method introduced to solve the distributed optimization problems. Under the assumption of the strong convexity of local…

Optimization and Control · Mathematics 2022-06-28 Woocheol Choi , Doheon Kim , Seok-Bae Yun

A Fast Distributed Proximal-Gradient Method

We present a distributed proximal-gradient method for optimizing the average of convex functions, each of which is the private local objective of an agent in a network with time-varying topology. The local objectives have distinct…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-10-09 Annie I. Chen , Asuman Ozdaglar

QSGD: Communication-Efficient SGD via Gradient Quantization and Encoding

Parallel implementations of stochastic gradient descent (SGD) have received significant research attention, thanks to excellent scalability properties of this algorithm, and to its efficiency in the context of training deep neural networks.…

Machine Learning · Computer Science 2017-12-07 Dan Alistarh , Demjan Grubic , Jerry Li , Ryota Tomioka , Milan Vojnovic

Convergence Rates of Distributed Nesterov-like Gradient Methods on Random Networks

We consider distributed optimization in random networks where N nodes cooperatively minimize the sum \sum_{i=1}^N f_i(x) of their individual convex costs. Existing literature proposes distributed gradient-like methods that are…

Information Theory · Computer Science 2023-07-19 Dusan Jakovetic , Joao Xavier , Jose M. F. Moura

Convergence of Limited Communications Gradient Methods

Distributed optimization increasingly plays a central role in economical and sustainable operation of cyber-physical systems. Nevertheless, the complete potential of the technology has not yet been fully exploited in practice due to…

Optimization and Control · Mathematics 2017-10-24 Sindri Magnusson , Chinwendu Enyioha , Na Li , Carlo Fischione , Vahid Tarokh

Distributed Optimization via Gradient Descent with Event-Triggered Zooming over Quantized Communication

In this paper, we study unconstrained distributed optimization strongly convex problems, in which the exchange of information in the network is captured by a directed graph topology over digital channels that have limited capacity (and…

Systems and Control · Electrical Eng. & Systems 2023-09-12 Apostolos I. Rikos , Wei Jiang , Themistoklis Charalambous , Karl H. Johansson

A Distributed Training Algorithm of Generative Adversarial Networks with Quantized Gradients

Training generative adversarial networks (GAN) in a distributed fashion is a promising technology since it is contributed to training GAN on a massive of data efficiently in real-world applications. However, GAN is known to be difficult to…

Machine Learning · Computer Science 2020-10-27 Xiaojun Chen , Shu Yang , Li Shen , Xuanrong Pang

Communication-Compressed Adaptive Gradient Method for Distributed Nonconvex Optimization

Due to the explosion in the size of the training datasets, distributed learning has received growing interest in recent years. One of the major bottlenecks is the large communication cost between the central server and the local workers.…

Machine Learning · Computer Science 2022-02-25 Yujia Wang , Lu Lin , Jinghui Chen