中文
相关论文

相关论文: Communication-Efficient Approximate Gradient Codin…

200 篇论文

In this paper, we propose an optimally structured gradient coding scheme to mitigate the straggler problem in distributed learning. Conventional gradient coding methods often assume homogeneous straggler models or rely on excessive data…

系统与控制 · 电气工程与系统科学 2025-10-28 Heekang Song , Wan Choi

Distributed implementations of gradient-based methods, wherein a server distributes gradient computations across worker machines, need to overcome two limitations: delays caused by slow running machines called 'stragglers', and…

信息论 · 计算机科学 2020-05-15 Swanand Kadhe , O. Ozan Koyluoglu , Kannan Ramchandran

Gradient coding allows a master node to derive the aggregate of the partial gradients, calculated by some worker nodes over the local data sets, with minimum communication cost, and in the presence of stragglers. In this paper, for gradient…

信息论 · 计算机科学 2021-03-03 Tayyebeh Jahani-Nezhad , Mohammad Ali Maddah-Ali

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures. A key bottleneck is the communication overhead for exchanging information such as…

机器学习 · 计算机科学 2017-10-31 Jianqiao Wangni , Jialei Wang , Ji Liu , Tong Zhang

Large-scale distributed learning aims at minimizing a loss function $L$ that depends on a training dataset with respect to a $d$-length parameter vector. The distributed cluster typically consists of a parameter server (PS) and multiple…

信息论 · 计算机科学 2026-03-25 Sifat Munim , Aditya Ramamoorthy

This paper develops coding techniques to reduce the running time of distributed learning tasks. It characterizes the fundamental tradeoff to compute gradients (and more generally vector summations) in terms of three parameters: computation…

机器学习 · 统计学 2018-02-13 Min Ye , Emmanuel Abbe

We consider the problem of communication efficient distributed optimization where multiple nodes exchange important algorithm information in every iteration to solve large problems. In particular, we focus on the stochastic variance-reduced…

机器学习 · 计算机科学 2020-03-16 Hossein S. Ghadikolaei , Sindri Magnusson

In distributed optimization problems, a technique called gradient coding, which involves replicating data points, has been used to mitigate the effect of straggling machines. Recent work has studied approximate gradient coding, which…

机器学习 · 统计学 2021-08-09 Margalit Glasgow , Mary Wootters

In recent years, distributed optimization is proven to be an effective approach to accelerate training of large scale machine learning models such as deep neural networks. With the increasing computation power of GPUs, the bottleneck of…

机器学习 · 计算机科学 2021-09-14 Xiangyi Chen , Xiaoyun Li , Ping Li

Gradient descent algorithms are widely used in machine learning. In order to deal with huge volume of data, we consider the implementation of gradient descent algorithms in a distributed computing setting where multiple workers compute the…

分布式、并行与集群计算 · 计算机科学 2019-01-29 Haozhao Wang , Song Guo , Bin Tang , Ruixuan Li , Chengjie Li

Optimization in distributed networks plays a central role in almost all distributed machine learning problems. In principle, the use of distributed task allocation has reduced the computational time, allowing better response rates and…

最优化与控制 · 数学 2020-07-28 Elie Atallah , Nazanin Rahnavard , Chinwendu Enyioha

Communication bottlenecks and the presence of stragglers pose significant challenges in distributed learning (DL). To deal with these challenges, recent advances leverage unbiased compression functions and gradient coding. However, the…

分布式、并行与集群计算 · 计算机科学 2026-03-18 Chengxi Li , Ming Xiao , Mikael Skoglund

We propose an algorithm for distributed optimization over time-varying communication networks. Our algorithm uses an optimized ratio between the number of rounds of communication and gradient evaluations to achieve fast convergence. The…

最优化与控制 · 数学 2020-01-08 Bryan Van Scoy , Laurent Lessard

We focus on the commonly used synchronous Gradient Descent paradigm for large-scale distributed learning, for which there has been a growing interest to develop efficient and robust gradient aggregation strategies that overcome two key…

Gradient-based optimization methods implemented on distributed computing architectures are increasingly used to tackle large-scale machine learning applications. A key bottleneck in such distributed systems is the high communication…

分布式、并行与集群计算 · 计算机科学 2024-06-11 Xiaoge Deng , Dongsheng Li , Tao Sun , Xicheng Lu

Distributed algorithms are often beset by the straggler effect, where the slowest compute nodes in the system dictate the overall running time. Coding-theoretic techniques have been recently proposed to mitigate stragglers via algorithmic…

机器学习 · 统计学 2017-11-21 Zachary Charles , Dimitris Papailiopoulos , Jordan Ellenberg

This paper considers a distributed stochastic strongly convex optimization, where agents connected over a network aim to cooperatively minimize the average of all agents' local cost functions. Due to the stochasticity of gradient estimation…

最优化与控制 · 数学 2020-02-17 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks:…

机器学习 · 计算机科学 2019-11-04 Amirhossein Reisizadeh , Hossein Taheri , Aryan Mokhtari , Hamed Hassani , Ramtin Pedarsani

Communication-efficient variants of SGD, specifically local SGD, have received a great deal of interest in recent years. These approaches compute multiple gradient steps locally on each worker, before averaging model parameters, helping…

机器学习 · 计算机科学 2025-06-13 Charles-Étienne Joseph , Benjamin Thérien , Abhinav Moudgil , Boris Knyazev , Eugene Belilovsky

This thesis is concerned with the design of distributed algorithms for solving optimization problems. We consider networks where each node has exclusive access to a cost function, and design algorithms that make all nodes cooperate to find…

最优化与控制 · 数学 2013-12-03 João F. C. Mota
‹ 上一页 1 2 3 10 下一页 ›