相关论文: Communication-Efficient Approximate Gradient Codin…

Approximate Gradient Coding for Distributed Learning with Heterogeneous Stragglers

In this paper, we propose an optimally structured gradient coding scheme to mitigate the straggler problem in distributed learning. Conventional gradient coding methods often assume homogeneous straggler models or rely on excessive data…

系统与控制 · 电气工程与系统科学 2025-10-28 Heekang Song , Wan Choi

Communication-Efficient Gradient Coding for Straggler Mitigation in Distributed Learning

Distributed implementations of gradient-based methods, wherein a server distributes gradient computations across worker machines, need to overcome two limitations: delays caused by slow running machines called 'stragglers', and…

信息论 · 计算机科学 2020-05-15 Swanand Kadhe , O. Ozan Koyluoglu , Kannan Ramchandran

Optimal Communication-Computation Trade-Off in Heterogeneous Gradient Coding

Gradient coding allows a master node to derive the aggregate of the partial gradients, calculated by some worker nodes over the local data sets, with minimum communication cost, and in the presence of stragglers. In this paper, for gradient…

信息论 · 计算机科学 2021-03-03 Tayyebeh Jahani-Nezhad , Mohammad Ali Maddah-Ali

Gradient Sparsification for Communication-Efficient Distributed Optimization

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures. A key bottleneck is the communication overhead for exchanging information such as…

机器学习 · 计算机科学 2017-10-31 Jianqiao Wangni , Jialei Wang , Ji Liu , Tong Zhang

Communication-Efficient Approximate Gradient Coding

Large-scale distributed learning aims at minimizing a loss function $L$ that depends on a training dataset with respect to a $d$-length parameter vector. The distributed cluster typically consists of a parameter server (PS) and multiple…

信息论 · 计算机科学 2026-03-25 Sifat Munim , Aditya Ramamoorthy

Communication-Computation Efficient Gradient Coding

This paper develops coding techniques to reduce the running time of distributed learning tasks. It characterizes the fundamental tradeoff to compute gradients (and more generally vector summations) in terms of three parameters: computation…

机器学习 · 统计学 2018-02-13 Min Ye , Emmanuel Abbe

Communication-efficient Variance-reduced Stochastic Gradient Descent

We consider the problem of communication efficient distributed optimization where multiple nodes exchange important algorithm information in every iteration to solve large problems. In particular, we focus on the stochastic variance-reduced…

机器学习 · 计算机科学 2020-03-16 Hossein S. Ghadikolaei , Sindri Magnusson

Approximate Gradient Coding with Optimal Decoding

In distributed optimization problems, a technique called gradient coding, which involves replicating data points, has been used to mitigate the effect of straggling machines. Recent work has studied approximate gradient coding, which…

机器学习 · 统计学 2021-08-09 Margalit Glasgow , Mary Wootters

Toward Communication Efficient Adaptive Gradient Method

In recent years, distributed optimization is proven to be an effective approach to accelerate training of large scale machine learning models such as deep neural networks. With the increasing computation power of GPUs, the bottleneck of…

机器学习 · 计算机科学 2021-09-14 Xiangyi Chen , Xiaoyun Li , Ping Li

Heterogeneity-aware Gradient Coding for Straggler Tolerance

Gradient descent algorithms are widely used in machine learning. In order to deal with huge volume of data, we consider the implementation of gradient descent algorithms in a distributed computing setting where multiple workers compute the…

分布式、并行与集群计算 · 计算机科学 2019-01-29 Haozhao Wang , Song Guo , Bin Tang , Ruixuan Li , Chengjie Li

Straggler-Robust Distributed Optimization with the Parameter Server Utilizing Coded Gradient

Optimization in distributed networks plays a central role in almost all distributed machine learning problems. In principle, the use of distributed task allocation has reduced the computational time, allowing better response rates and…

最优化与控制 · 数学 2020-07-28 Elie Atallah , Nazanin Rahnavard , Chinwendu Enyioha

Biased Compression in Gradient Coding for Distributed Learning

Communication bottlenecks and the presence of stragglers pose significant challenges in distributed learning (DL). To deal with these challenges, recent advances leverage unbiased compression functions and gradient coding. However, the…

分布式、并行与集群计算 · 计算机科学 2026-03-18 Chengxi Li , Ming Xiao , Mikael Skoglund

A Distributed Optimization Algorithm over Time-Varying Graphs with Efficient Gradient Evaluations

We propose an algorithm for distributed optimization over time-varying communication networks. Our algorithm uses an optimized ratio between the number of rounds of communication and gradient evaluations to achieve fast convergence. The…

最优化与控制 · 数学 2020-01-08 Bryan Van Scoy , Laurent Lessard

CodedReduce: A Fast and Robust Framework for Gradient Aggregation in Distributed Learning

We focus on the commonly used synchronous Gradient Descent paradigm for large-scale distributed learning, for which there has been a growing interest to develop efficient and robust gradient aggregation strategies that overcome two key…

机器学习 · 统计学 2021-09-30 Amirhossein Reisizadeh , Saurav Prakash , Ramtin Pedarsani , Amir Salman Avestimehr

Communication-Efficient Distributed Learning via Sparse and Adaptive Stochastic Gradient

Gradient-based optimization methods implemented on distributed computing architectures are increasingly used to tackle large-scale machine learning applications. A key bottleneck in such distributed systems is the high communication…

分布式、并行与集群计算 · 计算机科学 2024-06-11 Xiaoge Deng , Dongsheng Li , Tao Sun , Xicheng Lu

Approximate Gradient Coding via Sparse Random Graphs

Distributed algorithms are often beset by the straggler effect, where the slowest compute nodes in the system dictate the overall running time. Coding-theoretic techniques have been recently proposed to mitigate stragglers via algorithmic…

机器学习 · 统计学 2017-11-21 Zachary Charles , Dimitris Papailiopoulos , Jordan Ellenberg

Linearly Convergent Algorithm with Variance Reduction for Distributed Stochastic Optimization

This paper considers a distributed stochastic strongly convex optimization, where agents connected over a network aim to cooperatively minimize the average of all agents' local cost functions. Due to the stochasticity of gradient estimation…

最优化与控制 · 数学 2020-02-17 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Robust and Communication-Efficient Collaborative Learning

We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks:…

机器学习 · 计算机科学 2019-11-04 Amirhossein Reisizadeh , Hossein Taheri , Aryan Mokhtari , Hamed Hassani , Ramtin Pedarsani

Meta-learning Optimizers for Communication-Efficient Learning

Communication-efficient variants of SGD, specifically local SGD, have received a great deal of interest in recent years. These approaches compute multiple gradient steps locally on each worker, before averaging model parameters, helping…

机器学习 · 计算机科学 2025-06-13 Charles-Étienne Joseph , Benjamin Thérien , Abhinav Moudgil , Boris Knyazev , Eugene Belilovsky

Communication-Efficient Algorithms For Distributed Optimization

This thesis is concerned with the design of distributed algorithms for solving optimization problems. We consider networks where each node has exclusive access to a cost function, and design algorithms that make all nodes cooperate to find…

最优化与控制 · 数学 2013-12-03 João F. C. Mota