Related papers: Consensus Control for Decentralized Deep Learning

Efficient Decentralized Deep Learning by Dynamic Model Averaging

We propose an efficient protocol for decentralized training of deep neural networks from distributed data sources. The proposed protocol allows to handle different phases of model training equally well and to quickly adapt to concept…

Machine Learning · Computer Science 2018-11-14 Michael Kamp , Linara Adilova , Joachim Sicking , Fabian Hüger , Peter Schlicht , Tim Wirtz , Stefan Wrobel

Consensus Driven Learning

As the complexity of our neural network models grow, so too do the data and computation requirements for successful training. One proposed solution to this problem is training on a distributed network of computational devices, thus…

Machine Learning · Computer Science 2020-05-22 Kyle Crandall , Dustin Webb

Controlled disagreement improves generalization in decentralized training

Decentralized training is often regarded as inferior to centralized training because the consensus errors between workers are thought to undermine convergence and generalization, even with homogeneous data distributions. This work…

Machine Learning · Computer Science 2026-02-04 Zesen Wang , Mikael Johansson

Accelerating Optimization and Machine Learning through Decentralization

Decentralized optimization enables multiple devices to learn a global machine learning model while each individual device only has access to its local dataset. By avoiding the need for training data to leave individual users' devices, it…

Machine Learning · Computer Science 2026-04-22 Ziqin Chen , Zuang Wang , Yongqiang Wang

Communication-Efficient Learning of Deep Networks from Decentralized Data

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image…

Machine Learning · Computer Science 2023-01-30 H. Brendan McMahan , Eider Moore , Daniel Ramage , Seth Hampson , Blaise Agüera y Arcas

On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning

Decentralized learning provides a scalable alternative to parameter-server-based training, yet its performance is often hindered by limited peer-to-peer communication. In this paper, we study how communication should be scheduled over time,…

Machine Learning · Computer Science 2026-04-28 Tongtian Zhu , Tianyu Zhang , Mingze Wang , Zhanpeng Zhou , Can Wang

Quasi-Global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data

Decentralized training of deep learning models is a key element for enabling data privacy and on-device learning over networks. In realistic learning scenarios, the presence of heterogeneity across different clients' local datasets poses an…

Machine Learning · Computer Science 2021-06-21 Tao Lin , Sai Praneeth Karimireddy , Sebastian U. Stich , Martin Jaggi

Analyzing the benefits of communication channels between deep learning models

As artificial intelligence systems spread to more diverse and larger tasks in many domains, the machine learning algorithms, and in particular the deep learning models and the databases required to train them are getting bigger themselves.…

Machine Learning · Computer Science 2019-04-22 Philippe Lacaille

Collaborative Deep Learning Across Multiple Data Centers

Valuable training data is often owned by independent organizations and located in multiple data centers. Most deep learning approaches require to centralize the multi-datacenter data for performance purpose. In practice, however, it is…

Machine Learning · Computer Science 2018-10-17 Kele Xu , Haibo Mi , Dawei Feng , Huaimin Wang , Chuan Chen , Zibin Zheng , Xu Lan

On Generalization of Decentralized Learning with Separable Data

Decentralized learning offers privacy and communication efficiency when data are naturally distributed among agents communicating over an underlying graph. Motivated by overparameterized learning settings, in which models are trained to…

Machine Learning · Computer Science 2023-03-28 Hossein Taheri , Christos Thrampoulidis

AdaGossip: Adaptive Consensus Step-size for Decentralized Deep Learning with Communication Compression

Decentralized learning is crucial in supporting on-device learning over large distributed datasets, eliminating the need for a central server. However, the communication overhead remains a major bottleneck for the practical realization of…

Machine Learning · Computer Science 2024-04-10 Sai Aparna Aketi , Abolfazl Hashemi , Kaushik Roy

Decentralized Deep Learning using Momentum-Accelerated Consensus

We consider the problem of decentralized deep learning where multiple agents collaborate to learn from a distributed dataset. While there exist several decentralized deep learning approaches, the majority consider a central parameter-server…

Machine Learning · Computer Science 2020-12-01 Aditya Balu , Zhanhong Jiang , Sin Yong Tan , Chinmay Hedge , Young M Lee , Soumik Sarkar

Exploring the Robustness of Decentralized Training for Large Language Models

Decentralized training of large language models has emerged as an effective way to democratize this technology. However, the potential threats associated with this approach have not been carefully discussed, which would hinder the…

Machine Learning · Computer Science 2023-12-05 Lin Lu , Chenxi Dai , Wangcheng Tao , Binhang Yuan , Yanan Sun , Pan Zhou

From promise to practice: realizing high-performance decentralized training

Decentralized training of deep neural networks has attracted significant attention for its theoretically superior scalability over synchronous data-parallel methods like All-Reduce. However, realizing this potential in multi-node training…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-17 Zesen Wang , Jiaojiao Zhang , Xuyang Wu , Mikael Johansson

Decentralized Consensus Algorithm with Delayed and Stochastic Gradients

We analyze the convergence of decentralized consensus algorithm with delayed gradient information across the network. The nodes in the network privately hold parts of the objective function and collaboratively solve for the consensus…

Optimization and Control · Mathematics 2018-01-17 Benjamin Sirb , Xiaojing Ye

DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training

The scale of deep learning nowadays calls for efficient distributed training algorithms. Decentralized momentum SGD (DmSGD), in which each node averages only with its neighbors, is more communication efficient than vanilla Parallel momentum…

Machine Learning · Computer Science 2021-04-27 Kun Yuan , Yiming Chen , Xinmeng Huang , Yingya Zhang , Pan Pan , Yinghui Xu , Wotao Yin

Decentralized Learning Made Practical with Client Sampling

Decentralized learning (DL) leverages edge devices for collaborative model training while avoiding coordination by a central server. Due to privacy concerns, DL has become an attractive alternative to centralized learning schemes since…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-08 Martijn de Vos , Akash Dhasade , Anne-Marie Kermarrec , Erick Lavoie , Johan Pouwelse , Rishi Sharma

On the Convergence of Decentralized Stochastic Gradient-Tracking with Finite-Time Consensus

Algorithms for decentralized optimization and learning rely on local optimization steps coupled with combination steps over a graph. Recent works have demonstrated that using a time-varying sequence of matrices that achieves finite-time…

Optimization and Control · Mathematics 2026-02-17 Aaron Fainman , Stefan Vlaski

Scaling Up Data Parallelism in Decentralized Deep Learning

Although it has been extensively explored in theory, decentralized learning is not yet green-lighted for production use, largely due to a lack of stability, scalability, and generality in large scale DNN training. To shed light on the…

Machine Learning · Computer Science 2025-09-17 Bing Xie , Junqi Yin , Zhenyu Zhou , Sarp Oral , Feiyi Wang

Linear Regression with Distributed Learning: A Generalization Error Perspective

Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear…

Machine Learning · Statistics 2021-11-03 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén