Related papers: Scale-Robust Timely Asynchronous Decentralized Lea…

Finite-Time Consensus Learning for Decentralized Optimization with Nonlinear Gossiping

Distributed learning has become an integral tool for scaling up machine learning and addressing the growing need for data privacy. Although more robust to the network topology, decentralized learning schemes have not gained the same level…

Machine Learning · Computer Science 2021-11-16 Junya Chen , Sijia Wang , Lawrence Carin , Chenyang Tao

Fully Decentralized Joint Learning of Personalized Models and Collaboration Graphs

We consider the fully decentralized machine learning scenario where many users with personal datasets collaborate to learn models through local peer-to-peer exchanges, without a central coordinator. We propose to train personalized models…

Machine Learning · Computer Science 2024-12-20 Valentina Zantedeschi , Aurélien Bellet , Marc Tommasi

Decentralized Collaborative Learning of Personalized Models over Networks

We consider a set of learning agents in a collaborative peer-to-peer network, where each agent learns a personalized model according to its own learning objective. The question addressed in this paper is: how can agents improve upon their…

Machine Learning · Computer Science 2019-01-25 Paul Vanhaesebrouck , Aurélien Bellet , Marc Tommasi

Accelerating Optimization and Machine Learning through Decentralization

Decentralized optimization enables multiple devices to learn a global machine learning model while each individual device only has access to its local dataset. By avoiding the need for training data to leave individual users' devices, it…

Machine Learning · Computer Science 2026-04-22 Ziqin Chen , Zuang Wang , Yongqiang Wang

How to scale distributed deep learning?

Training time on large datasets for deep neural networks is the principal workflow bottleneck in a number of important applications of deep learning, such as object classification and detection in automatic driver assistance systems (ADAS).…

Machine Learning · Computer Science 2016-11-15 Peter H. Jin , Qiaochu Yuan , Forrest Iandola , Kurt Keutzer

On the Surprising Effectiveness of a Single Global Merging in Decentralized Learning

Decentralized learning provides a scalable alternative to parameter-server-based training, yet its performance is often hindered by limited peer-to-peer communication. In this paper, we study how communication should be scheduled over time,…

Machine Learning · Computer Science 2026-04-28 Tongtian Zhu , Tianyu Zhang , Mingze Wang , Zhanpeng Zhou , Can Wang

Delay-adaptive step-sizes for asynchronous learning

In scalable machine learning systems, model training is often parallelized over multiple nodes that run without tight synchronization. Most analysis results for the related asynchronous algorithms use an upper bound on the information…

Machine Learning · Computer Science 2022-04-12 Xuyang Wu , Sindri Magnusson , Hamid Reza Feyzmahdavian , Mikael Johansson

Asynchronous Decentralized Learning over Unreliable Wireless Networks

Decentralized learning enables edge users to collaboratively train models by exchanging information via device-to-device communication, yet prior works have been limited to wireless networks with fixed topologies and reliable workers. In…

Information Theory · Computer Science 2022-02-03 Eunjeong Jeong , Matteo Zecchin , Marios Kountouris

Decentralized Learning Made Practical with Client Sampling

Decentralized learning (DL) leverages edge devices for collaborative model training while avoiding coordination by a central server. Due to privacy concerns, DL has become an attractive alternative to centralized learning schemes since…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-08 Martijn de Vos , Akash Dhasade , Anne-Marie Kermarrec , Erick Lavoie , Johan Pouwelse , Rishi Sharma

DIGEST: Fast and Communication Efficient Decentralized Learning with Local Updates

Two widely considered decentralized learning algorithms are Gossip and random walk-based learning. Gossip algorithms (both synchronous and asynchronous versions) suffer from high communication cost, while random-walk based learning…

Machine Learning · Computer Science 2024-05-14 Peyman Gholami , Hulya Seferoglu

DRACO: Decentralized Asynchronous Federated Learning over Row-Stochastic Wireless Networks

Recent developments and emerging use cases, such as smart Internet of Things (IoT) and Edge AI, have sparked considerable interest in the training of neural networks over fully decentralized (serverless) networks. One of the major…

Machine Learning · Computer Science 2025-01-30 Eunjeong Jeong , Marios Kountouris

Graph-based Gossiping for Communication Efficiency in Decentralized Federated Learning

Federated learning has emerged as a privacy-preserving technique for collaborative model training across heterogeneously distributed silos. Yet, its reliance on a single central server introduces potential bottlenecks and risks of…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-13 Huong Nguyen , Hong-Tri Nguyen , Praveen Kumar Donta , Susanna Pirttikangas , Lauri Lovén

Scaling Up Data Parallelism in Decentralized Deep Learning

Although it has been extensively explored in theory, decentralized learning is not yet green-lighted for production use, largely due to a lack of stability, scalability, and generality in large scale DNN training. To shed light on the…

Machine Learning · Computer Science 2025-09-17 Bing Xie , Junqi Yin , Zhenyu Zhou , Sarp Oral , Feiyi Wang

Ravnest: Decentralized Asynchronous Training on Heterogeneous Devices

Modern deep learning models, growing larger and more complex, have demonstrated exceptional generalization and accuracy due to training on huge datasets. This trend is expected to continue. However, the increasing size of these models poses…

Machine Learning · Computer Science 2024-05-24 Anirudh Rajiv Menon , Unnikrishnan Menon , Kailash Ahirwar

Enhancing Parallelism in Decentralized Stochastic Convex Optimization

Decentralized learning has emerged as a powerful approach for handling large datasets across multiple machines in a communication-efficient manner. However, such methods often face scalability limitations, as increasing the number of…

Machine Learning · Computer Science 2025-06-03 Ofri Eisen , Ron Dorfman , Kfir Y. Levy

Faster Asynchronous SGD

Asynchronous distributed stochastic gradient descent methods have trouble converging because of stale gradients. A gradient update sent to a parameter server by a client is stale if the parameters used to calculate that gradient have since…

Machine Learning · Statistics 2016-01-18 Augustus Odena

Consensus Driven Learning

As the complexity of our neural network models grow, so too do the data and computation requirements for successful training. One proposed solution to this problem is training on a distributed network of computational devices, thus…

Machine Learning · Computer Science 2020-05-22 Kyle Crandall , Dustin Webb

Asynchronous Decentralized SGD with Quantized and Local Updates

Decentralized optimization is emerging as a viable alternative for scalable distributed machine learning, but also introduces new challenges in terms of synchronization costs. To this end, several communication-reduction techniques, such as…

Machine Learning · Computer Science 2022-03-28 Giorgi Nadiradze , Amirmojtaba Sabour , Peter Davies , Shigang Li , Dan Alistarh

Gossip Learning with Linear Models on Fully Distributed Data

Machine learning over fully distributed data poses an important problem in peer-to-peer (P2P) applications. In this model we have one data record at each network node, but without the possibility to move raw data due to privacy…

Machine Learning · Computer Science 2012-06-07 Róbert Ormándi , István Hegedüs , Márk Jelasity

Decentralized Learning in Online Queuing Systems

Motivated by packet routing in computer networks, online queuing systems are composed of queues receiving packets at different rates. Repeatedly, they send packets to servers, each of them treating only at most one packet at a time. In the…

Machine Learning · Statistics 2021-11-05 Flore Sentenac , Etienne Boursier , Vianney Perchet