Related papers: Distributed Deep Learning with Event-Triggered Com…

Distributed Stochastic Gradient Method for Non-Convex Problems with Applications in Supervised Learning

We develop a distributed stochastic gradient descent algorithm for solving non-convex optimization problems under the assumption that the local objective functions are twice continuously differentiable with Lipschitz continuous gradients…

Optimization and Control · Mathematics 2019-08-20 Jemin George , Tao Yang , He Bai , Prudhvi Gurram

EventGraD: Event-Triggered Communication in Parallel Machine Learning

Communication in parallel systems imposes significant overhead which often turns out to be a bottleneck in parallel machine learning. To relieve some of this overhead, in this paper, we present EventGraD - an algorithm with event-triggered…

Machine Learning · Computer Science 2021-12-10 Soumyadip Ghosh , Bernardo Aquino , Vijay Gupta

Leader Stochastic Gradient Descent for Distributed Training of Deep Learning Models: Extension

We consider distributed optimization under communication constraints for training deep learning models. We propose a new algorithm, whose parameter updates rely on two forces: a regular gradient step, and a corrective direction dictated by…

Machine Learning · Computer Science 2022-04-29 Yunfei Teng , Wenbo Gao , Francois Chalus , Anna Choromanska , Donald Goldfarb , Adrian Weller

Decentralized Asynchronous Non-convex Stochastic Optimization on Directed Graphs

Distributed Optimization is an increasingly important subject area with the rise of multi-agent control and optimization. We consider a decentralized stochastic optimization problem where the agents on a graph aim to asynchronously optimize…

Optimization and Control · Mathematics 2021-10-22 Vyacheslav Kungurtsev , Mahdi Morafah , Tara Javidi , Gesualdo Scutari

CoDGraD: A Code-based Distributed Gradient Descent Scheme for Decentralized Convex Optimization

In this paper, we consider a large network containing many regions such that each region is equipped with a worker with some data processing and communication capability. For such a network, some workers may become stragglers due to the…

Systems and Control · Electrical Eng. & Systems 2022-04-14 Elie Atallah , Nazanin Rahnavard , Qiyu Sun

Optimal Statistical Rates for Decentralised Non-Parametric Regression with Linear Speed-Up

We analyse the learning performance of Distributed Gradient Descent in the context of multi-agent decentralised non-parametric regression with the square loss function when i.i.d. samples are assigned to agents. We show that if agents hold…

Machine Learning · Statistics 2019-11-14 Dominic Richards , Patrick Rebeschini

Communication-Efficient Distributionally Robust Decentralized Learning

Decentralized learning algorithms empower interconnected devices to share data and computational resources to collaboratively train a machine learning model without the aid of a central coordinator. In the case of heterogeneous data…

Machine Learning · Computer Science 2023-01-16 Matteo Zecchin , Marios Kountouris , David Gesbert

Decentralized Stochastic Proximal Gradient Descent with Variance Reduction over Time-varying Networks

In decentralized learning, a network of nodes cooperate to minimize an overall objective function that is usually the finite-sum of their local objectives, and incorporates a non-smooth regularization term for the better generalization…

Machine Learning · Computer Science 2022-01-25 Xuanjie Li , Yuedong Xu , Jessie Hui Wang , Xin Wang , John C. S. Lui

Scaling up Stochastic Gradient Descent for Non-convex Optimisation

Stochastic gradient descent (SGD) is a widely adopted iterative method for optimizing differentiable objective functions. In this paper, we propose and discuss a novel approach to scale up SGD in applications involving non-convex functions…

Machine Learning · Statistics 2022-10-07 Saad Mohamad , Hamad Alamri , Abdelhamid Bouchachia

A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks

In distributed training of deep neural networks, people usually run Stochastic Gradient Descent (SGD) or its variants on each machine and communicate with other machines periodically. However, SGD might converge slowly in training some deep…

Machine Learning · Computer Science 2022-10-14 Mingrui Liu , Zhenxun Zhuang , Yunwei Lei , Chunyang Liao

Event-Triggered Distributed Estimation With Decaying Communication Rate

We study distributed estimation of a high-dimensional static parameter vector through a group of sensors whose communication network is modeled by a fixed directed graph. Different from existing time-triggered communication schemes, an…

Systems and Control · Electrical Eng. & Systems 2021-08-10 Xingkang He , Yu Xing , Junfeng Wu , Karl H. Johansson

DSPG: Decentralized Simultaneous Perturbations Gradient Descent Scheme

Distributed descent-based methods are an essential toolset to solving optimization problems in multi-agent system scenarios. Here the agents seek to optimize a global objective function through mutual cooperation. Oftentimes, cooperation is…

Optimization and Control · Mathematics 2019-08-28 Arunselvan Ramaswamy

Decentralized Federated Learning with Gradient Tracking over Time-Varying Directed Networks

We investigate the problem of agent-to-agent interaction in decentralized (federated) learning over time-varying directed graphs, and, in doing so, propose a consensus-based algorithm called DSGTm-TV. The proposed algorithm incorporates…

Optimization and Control · Mathematics 2024-09-27 Duong Thuy Anh Nguyen , Su Wang , Duong Tung Nguyen , Angelia Nedich , H. Vincent Poor

Communication-Efficient Stochastic Distributed Learning

We address distributed learning problems, both nonconvex and convex, over undirected networks. In particular, we design a novel algorithm based on the distributed Alternating Direction Method of Multipliers (ADMM) to address the challenges…

Machine Learning · Computer Science 2026-03-23 Xiaoxing Ren , Nicola Bastianello , Karl H. Johansson , Thomas Parisini

Distributed State Estimation with Deep Neural Networks for Uncertain Nonlinear Systems under Event-Triggered Communication

Distributed state estimation is examined for a sensor network tasked with reconstructing a system's state through the use of a distributed and event-triggered observer. Each agent in the sensor network employs a deep neural network (DNN) to…

Systems and Control · Electrical Eng. & Systems 2022-02-07 Federico M. Zegers , Runhan Sun , Girish Chowdhary , Warren E. Dixon

On the Distributed Optimization over Directed Networks

In this paper, we propose a distributed algorithm, called Directed-Distributed Gradient Descent (D-DGD), to solve multi-agent optimization problems over directed graphs. Existing algorithms mostly deal with similar problems under the…

Optimization and Control · Mathematics 2016-02-02 Chenguang Xi , Qiong Wu , Usman A. Khan

Asynchronous Decentralized Learning over Unreliable Wireless Networks

Decentralized learning enables edge users to collaboratively train models by exchanging information via device-to-device communication, yet prior works have been limited to wireless networks with fixed topologies and reliable workers. In…

Information Theory · Computer Science 2022-02-03 Eunjeong Jeong , Matteo Zecchin , Marios Kountouris

Distributed Learning in Non-Convex Environments -- Part I: Agreement at a Linear Rate

Driven by the need to solve increasingly complex optimization problems in signal processing and machine learning, there has been increasing interest in understanding the behavior of gradient-descent algorithms in non-convex environments.…

Optimization and Control · Mathematics 2019-07-04 Stefan Vlaski , Ali H. Sayed

Distributed Mirror Descent over Directed Graphs

In this paper, we propose Distributed Mirror Descent (DMD) algorithm for constrained convex optimization problems on a (strongly-)connected multi-agent network. We assume that each agent has a private objective function and a constraint…

Optimization and Control · Mathematics 2015-04-28 Chenguang Xi , Qiong Wu , Usman A. Khan

Distributed Event-Based Learning via ADMM

We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering…

Machine Learning · Computer Science 2025-11-20 Guner Dilsad Er , Sebastian Trimpe , Michael Muehlebach