English
Related papers

Related papers: Decentralized Deep Learning with Arbitrary Communi…

200 papers

Lossy gradient compression has become a practical tool to overcome the communication bottleneck in centrally coordinated distributed training of machine learning models. However, algorithms for decentralized training with compressed…

Machine Learning · Computer Science 2020-10-20 Thijs Vogels , Sai Praneeth Karimireddy , Martin Jaggi

Decentralized distributed learning is the key to enabling large-scale machine learning (training) on edge devices utilizing private user-generated local data, without relying on the cloud. However, the practical realization of such…

Machine Learning · Computer Science 2022-09-13 Sai Aparna Aketi , Sangamesh Kodge , Kaushik Roy

Optimizing distributed learning systems is an art of balancing between computation and communication. There have been two lines of research that try to deal with slower networks: {\em communication compression} for low bandwidth networks,…

Machine Learning · Computer Science 2019-02-04 Hanlin Tang , Shaoduo Gan , Ce Zhang , Tong Zhang , Ji Liu

Regional energy caps limit the growth of any single data center used for large-scale model training. This single-center training paradigm works when model size remains manageable, but exponential growth in the model size and computational…

Machine Learning · Computer Science 2025-12-18 Rongwei Lu , Jingyan Jiang , Chunyang Li , Xingguang Wei , Zhi Wang

With the emergence of distributed data, training machine learning models in the serverless manner has attracted increasing attention in recent years. Numerous training approaches have been proposed in this regime, such as decentralized SGD.…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-25 Hongchang Gao , Heng Huang

Recent developments and emerging use cases, such as smart Internet of Things (IoT) and Edge AI, have sparked considerable interest in the training of neural networks over fully decentralized (serverless) networks. One of the major…

Machine Learning · Computer Science 2025-01-30 Eunjeong Jeong , Marios Kountouris

Distributed learning techniques such as federated learning have enabled multiple workers to train machine learning models together to reduce the overall training time. However, current distributed training algorithms (centralized or…

Machine Learning · Computer Science 2020-02-25 Zhenheng Tang , Shaohuai Shi , Xiaowen Chu

Decentralized learning algorithms empower interconnected devices to share data and computational resources to collaboratively train a machine learning model without the aid of a central coordinator. In the case of heterogeneous data…

Machine Learning · Computer Science 2023-01-16 Matteo Zecchin , Marios Kountouris , David Gesbert

Training at the edge utilizes continuously evolving data generated at different locations. Privacy concerns prohibit the co-location of this spatially as well as temporally distributed data, deeming it crucial to design training algorithms…

Machine Learning · Computer Science 2023-03-28 Sakshi Choudhary , Sai Aparna Aketi , Gobinda Saha , Kaushik Roy

Decentralized training of deep learning models is a key element for enabling data privacy and on-device learning over networks. In realistic learning scenarios, the presence of heterogeneity across different clients' local datasets poses an…

Machine Learning · Computer Science 2021-06-21 Tao Lin , Sai Praneeth Karimireddy , Sebastian U. Stich , Martin Jaggi

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image…

Machine Learning · Computer Science 2023-01-30 H. Brendan McMahan , Eider Moore , Daniel Ramage , Seth Hampson , Blaise Agüera y Arcas

We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept…

Machine Learning · Computer Science 2018-11-14 Michael Kamp , Linara Adilova , Joachim Sicking , Fabian Hüger , Peter Schlicht , Tim Wirtz , Stefan Wrobel

Scaling models has led to significant advancements in deep learning, but training these models in decentralized settings remains challenging due to communication bottlenecks. While existing compression techniques are effective in…

Machine Learning · Computer Science 2025-06-03 Sameera Ramasinghe , Thalaiyasingam Ajanthan , Gil Avraham , Yan Zuo , Alexander Long

Decentralized optimization and communication compression have exhibited their great potential in accelerating distributed machine learning by mitigating the communication bottleneck in practice. While existing decentralized algorithms with…

Machine Learning · Computer Science 2021-08-13 Yao Li , Xiaorui Liu , Jiliang Tang , Ming Yan , Kun Yuan

Decentralized optimization methods enable on-device training of machine learning models without a central coordinator. In many scenarios communication between devices is energy demanding and time consuming and forms the bottleneck of the…

Optimization and Control · Mathematics 2020-11-04 Dmitry Kovalev , Anastasia Koloskova , Martin Jaggi , Peter Richtarik , Sebastian U. Stich

Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of…

Machine Learning · Computer Science 2024-04-10 Sai Aparna Aketi , Abolfazl Hashemi , Kaushik Roy

Decentralized training enables learning with distributed datasets generated at different locations without relying on a central server. In realistic scenarios, the data distribution across these sparsely connected learning agents can be…

Machine Learning · Computer Science 2025-02-27 Sakshi Choudhary , Sai Aparna Aketi , Kaushik Roy

Decentralized learning has emerged as a powerful approach for handling large datasets across multiple machines in a communication-efficient manner. However, such methods often face scalability limitations, as increasing the number of…

Machine Learning · Computer Science 2025-06-03 Ofri Eisen , Ron Dorfman , Kfir Y. Levy

In this paper, we propose and analyze SQuARM-SGD, a communication-efficient algorithm for decentralized training of large-scale machine learning models over a network. In SQuARM-SGD, each node performs a fixed number of local SGD steps…

Machine Learning · Computer Science 2021-10-12 Navjot Singh , Deepesh Data , Jemin George , Suhas Diggavi

Communication is a key bottleneck in distributed training. Recently, an \emph{error-compensated} compression technology was particularly designed for the \emph{centralized} learning and receives huge successes, by showing significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-08-06 Hanlin Tang , Xiangru Lian , Shuang Qiu , Lei Yuan , Ce Zhang , Tong Zhang , Ji Liu
‹ Prev 1 2 3 10 Next ›