Related papers: Decentralized Deep Learning with Arbitrary Communi…

PowerGossip: Practical Low-Rank Communication Compression in Decentralized Deep Learning

Lossy gradient compression has become a practical tool to overcome the communication bottleneck in centrally coordinated distributed training of machine learning models. However, algorithms for decentralized training with compressed…

Machine Learning · Computer Science 2020-10-20 Thijs Vogels , Sai Praneeth Karimireddy , Martin Jaggi

Low Precision Decentralized Distributed Training over IID and non-IID Data

Decentralized distributed learning is the key to enabling large-scale machine learning (training) on edge devices utilizing private user-generated local data, without relying on the cloud. However, the practical realization of such…

Machine Learning · Computer Science 2022-09-13 Sai Aparna Aketi , Sangamesh Kodge , Kaushik Roy

Communication Compression for Decentralized Training

Optimizing distributed learning systems is an art of balancing between computation and communication. There have been two lines of research that try to deal with slower networks: {\em communication compression} for low bandwidth networks,…

Machine Learning · Computer Science 2019-02-04 Hanlin Tang , Shaoduo Gan , Ce Zhang , Tong Zhang , Ji Liu

Taming Latency and Bandwidth: A Theoretical Framework and Adaptive Algorithm for Communication-Constrained Training

Regional energy caps limit the growth of any single data center used for large-scale model training. This single-center training paradigm works when model size remains manageable, but exponential growth in the model size and computational…

Machine Learning · Computer Science 2025-12-18 Rongwei Lu , Jingyan Jiang , Chunyang Li , Xingguang Wei , Zhi Wang

Adaptive Serverless Learning

With the emergence of distributed data, training machine learning models in the serverless manner has attracted increasing attention in recent years. Numerous training approaches have been proposed in this regime, such as decentralized SGD.…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-25 Hongchang Gao , Heng Huang

DRACO: Decentralized Asynchronous Federated Learning over Row-Stochastic Wireless Networks

Recent developments and emerging use cases, such as smart Internet of Things (IoT) and Edge AI, have sparked considerable interest in the training of neural networks over fully decentralized (serverless) networks. One of the major…

Machine Learning · Computer Science 2025-01-30 Eunjeong Jeong , Marios Kountouris

Communication-Efficient Decentralized Learning with Sparsification and Adaptive Peer Selection

Distributed learning techniques such as federated learning have enabled multiple workers to train machine learning models together to reduce the overall training time. However, current distributed training algorithms (centralized or…

Machine Learning · Computer Science 2020-02-25 Zhenheng Tang , Shaohuai Shi , Xiaowen Chu

Communication-Efficient Distributionally Robust Decentralized Learning

Decentralized learning algorithms empower interconnected devices to share data and computational resources to collaboratively train a machine learning model without the aid of a central coordinator. In the case of heterogeneous data…

Machine Learning · Computer Science 2023-01-16 Matteo Zecchin , Marios Kountouris , David Gesbert

CoDeC: Communication-Efficient Decentralized Continual Learning

Training at the edge utilizes continuously evolving data generated at different locations. Privacy concerns prohibit the co-location of this spatially as well as temporally distributed data, deeming it crucial to design training algorithms…

Machine Learning · Computer Science 2023-03-28 Sakshi Choudhary , Sai Aparna Aketi , Gobinda Saha , Kaushik Roy

Quasi-Global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data

Decentralized training of deep learning models is a key element for enabling data privacy and on-device learning over networks. In realistic learning scenarios, the presence of heterogeneity across different clients' local datasets poses an…

Machine Learning · Computer Science 2021-06-21 Tao Lin , Sai Praneeth Karimireddy , Sebastian U. Stich , Martin Jaggi

Communication-Efficient Learning of Deep Networks from Decentralized Data

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image…

Machine Learning · Computer Science 2023-01-30 H. Brendan McMahan , Eider Moore , Daniel Ramage , Seth Hampson , Blaise Agüera y Arcas

Efficient Decentralized Deep Learning by Dynamic Model Averaging

We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept…

Machine Learning · Computer Science 2018-11-14 Michael Kamp , Linara Adilova , Joachim Sicking , Fabian Hüger , Peter Schlicht , Tim Wirtz , Stefan Wrobel

Protocol Models: Scaling Decentralized Training with Communication-Efficient Model Parallelism

Scaling models has led to significant advancements in deep learning, but training these models in decentralized settings remains challenging due to communication bottlenecks. While existing compression techniques are effective in…

Machine Learning · Computer Science 2025-06-03 Sameera Ramasinghe , Thalaiyasingam Ajanthan , Gil Avraham , Yan Zuo , Alexander Long

Decentralized Composite Optimization with Compression

Decentralized optimization and communication compression have exhibited their great potential in accelerating distributed machine learning by mitigating the communication bottleneck in practice. While existing decentralized algorithms with…

Machine Learning · Computer Science 2021-08-13 Yao Li , Xiaorui Liu , Jiliang Tang , Ming Yan , Kun Yuan

A Linearly Convergent Algorithm for Decentralized Optimization: Sending Less Bits for Free!

Decentralized optimization methods enable on-device training of machine learning models without a central coordinator. In many scenarios communication between devices is energy demanding and time consuming and forms the bottleneck of the…

Optimization and Control · Mathematics 2020-11-04 Dmitry Kovalev , Anastasia Koloskova , Martin Jaggi , Peter Richtarik , Sebastian U. Stich

AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of…

Machine Learning · Computer Science 2024-04-10 Sai Aparna Aketi , Abolfazl Hashemi , Kaushik Roy

SADDLe: Sharpness-Aware Decentralized Deep Learning with Heterogeneous Data

Decentralized training enables learning with distributed datasets generated at different locations without relying on a central server. In realistic scenarios, the data distribution across these sparsely connected learning agents can be…

Machine Learning · Computer Science 2025-02-27 Sakshi Choudhary , Sai Aparna Aketi , Kaushik Roy

Enhancing Parallelism in Decentralized Stochastic Convex Optimization

Decentralized learning has emerged as a powerful approach for handling large datasets across multiple machines in a communication-efficient manner. However, such methods often face scalability limitations, as increasing the number of…

Machine Learning · Computer Science 2025-06-03 Ofri Eisen , Ron Dorfman , Kfir Y. Levy

SQuARM-SGD: Communication-Efficient Momentum SGD for Decentralized Optimization

In this paper, we propose and analyze SQuARM-SGD, a communication-efficient algorithm for decentralized training of large-scale machine learning models over a network. In SQuARM-SGD, each node performs a fixed number of local SGD steps…

Machine Learning · Computer Science 2021-10-12 Navjot Singh , Deepesh Data , Jemin George , Suhas Diggavi

$\texttt{DeepSqueeze}$: Decentralization Meets Error-Compensated Compression

Communication is a key bottleneck in distributed training. Recently, an \emph{error-compensated} compression technology was particularly designed for the \emph{centralized} learning and receives huge successes, by showing significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-06 Hanlin Tang , Xiangru Lian , Shuang Qiu , Lei Yuan , Ce Zhang , Tong Zhang , Ji Liu