Related papers: Accelerated Distributed Optimization with Compress…

Towards Faster Decentralized Stochastic Optimization with Communication Compression

Communication efficiency has garnered significant attention as it is considered the main bottleneck for large-scale decentralized Machine Learning applications in distributed and federated settings. In this regime, clients are restricted to…

Machine Learning · Computer Science 2024-11-26 Rustem Islamov , Yuan Gao , Sebastian U. Stich

A Better Alternative to Error Feedback for Communication-Efficient Distributed Learning

Modern large-scale machine learning applications require stochastic optimization algorithms to be implemented on distributed compute systems. A key bottleneck of such systems is the communication overhead for exchanging information across…

Machine Learning · Computer Science 2021-03-16 Samuel Horváth , Peter Richtárik

Contractive error feedback for gradient compression

On-device memory concerns in distributed deep learning have become severe due to (i) the growth of model size in multi-GPU training, and (ii) the wide adoption of deep neural networks for federated learning on IoT devices which have limited…

Machine Learning · Computer Science 2023-12-15 Bingcong Li , Shuai Zheng , Parameswaran Raman , Anshumali Shrivastava , Georgios B. Giannakis

Modular Distributed Nonconvex Learning with Error Feedback

In this paper, we design a novel distributed learning algorithm using stochastic compressed communications. In detail, we pursue a modular approach, merging ADMM and a gradient-based approach, benefiting from the robustness of the former…

Optimization and Control · Mathematics 2025-07-01 Guido Carnevale , Nicola Bastianello

Safe-EF: Error Feedback for Nonsmooth Constrained Optimization

Federated learning faces severe communication bottlenecks due to the high dimensionality of model updates. Communication compression with contractive compressors (e.g., Top-K) is often preferable in practice but can degrade performance…

Machine Learning · Computer Science 2025-06-04 Rustem Islamov , Yarden As , Ilyas Fatkhullin

Accelerated Methods with Compressed Communications for Distributed Optimization Problems under Data Similarity

In recent years, as data and problem sizes have increased, distributed learning has become an essential tool for training high-performance models. However, the communication bottleneck, especially for high-dimensional data, is a challenge.…

Optimization and Control · Mathematics 2025-04-28 Dmitry Bylinkin , Aleksandr Beznosikov

Communication-Compressed Adaptive Gradient Method for Distributed Nonconvex Optimization

Due to the explosion in the size of the training datasets, distributed learning has received growing interest in recent years. One of the major bottlenecks is the large communication cost between the central server and the local workers.…

Machine Learning · Computer Science 2022-02-25 Yujia Wang , Lu Lin , Jinghui Chen

EF21: A New, Simpler, Theoretically Better, and Practically Faster Error Feedback

Error feedback (EF), also known as error compensation, is an immensely popular convergence stabilization mechanism in the context of distributed training of supervised machine learning models enhanced by the use of contractive communication…

Machine Learning · Computer Science 2021-06-10 Peter Richtárik , Igor Sokolov , Ilyas Fatkhullin

Analysis of Error Feedback in Federated Non-Convex Optimization with Biased Compression

In federated learning (FL) systems, e.g., wireless networks, the communication cost between the clients and the central server can often be a bottleneck. To reduce the communication cost, the paradigm of communication compression has become…

Machine Learning · Statistics 2022-11-28 Xiaoyun Li , Ping Li

Compressed Proximal Federated Learning for Non-Convex Composite Optimization on Heterogeneous Data

Federated Composite Optimization (FCO) has emerged as a promising framework for training models with structural constraints (e.g., sparsity) in distributed edge networks. However, simultaneously achieving communication efficiency and…

Optimization and Control · Mathematics 2026-03-10 Pu Qiu , Chen Ouyang , Yongyang Xiong , Keyou You , Wanquan Liu , Yang Shi

Compressed Decentralized Momentum Stochastic Gradient Methods for Nonconvex Optimization

In this paper, we design two compressed decentralized algorithms for solving nonconvex stochastic optimization under two different scenarios. Both algorithms adopt a momentum technique to achieve fast convergence and a message-compression…

Machine Learning · Computer Science 2025-08-08 Wei Liu , Anweshit Panda , Ujwal Pandey , Christopher Brissette , Yikang Shen , George M. Slota , Naigang Wang , Jie Chen , Yangyang Xu

Accelerated Sparsified SGD with Error Feedback

A stochastic gradient method for synchronous distributed optimization is studied. For reducing communication cost, we particularly focus on utilization of compression of communicated gradients. Several work has shown that {\it{sparsified}}…

Optimization and Control · Mathematics 2020-06-22 Tomoya Murata , Taiji Suzuki

EControl: Fast Distributed Optimization with Compression and Error Control

Modern distributed training relies heavily on communication compression to reduce the communication overhead. In this work, we study algorithms employing a popular class of contractive compressors in order to reduce communication overhead.…

Optimization and Control · Mathematics 2023-11-13 Yuan Gao , Rustem Islamov , Sebastian Stich

Communication Compression for Distributed Learning without Control Variates

Distributed learning algorithms, such as the ones employed in Federated Learning (FL), require communication compression to reduce the cost of client uploads. The compression methods used in practice are often biased, making error feedback…

Machine Learning · Computer Science 2025-09-12 Tomas Ortega , Chun-Yin Huang , Xiaoxiao Li , Hamid Jafarkhani

CompressedScaffnew: The First Theoretical Double Acceleration of Communication from Local Training and Compression in Distributed Optimization

In distributed optimization, a large number of machines alternate between local computations and communication with a coordinating server. Communication, which can be slow and costly, is the main bottleneck in this setting. To reduce this…

Machine Learning · Computer Science 2026-04-03 Laurent Condat , Ivan Agarský , Peter Richtárik

Distributed Stochastic Consensus Optimization with Momentum for Nonconvex Nonsmooth Problems

While many distributed optimization algorithms have been proposed for solving smooth or convex problems over the networks, few of them can handle non-convex and non-smooth problems. Based on a proximal primal-dual approach, this paper…

Optimization and Control · Mathematics 2021-09-01 Zhiguo Wang , Jiawei Zhang , Tsung-Hui Chang , Jian Li , Zhi-Quan Luo

EF21 with Bells & Whistles: Six Algorithmic Extensions of Modern Error Feedback

First proposed by Seide (2014) as a heuristic, error feedback (EF) is a very popular mechanism for enforcing convergence of distributed gradient-based optimization methods enhanced with communication compression strategies based on the…

Machine Learning · Computer Science 2025-06-23 Ilyas Fatkhullin , Igor Sokolov , Eduard Gorbunov , Zhize Li , Peter Richtárik

Technical Report: A Totally Asynchronous Nesterov's Accelerated Gradient Method for Convex Optimization

We present a totally asynchronous algorithm for convex optimization that is based on a novel generalization of Nesterov's accelerated gradient method. This algorithm is developed for fast convergence under "total asynchrony," i.e., allowing…

Optimization and Control · Mathematics 2024-06-17 Ellie Pond , April Sebok , Zachary Bell , Matthew Hale

Composite Optimization with Error Feedback: the Dual Averaging Approach

Communication efficiency is a central challenge in distributed machine learning training, and message compression is a widely used solution. However, standard Error Feedback (EF) methods (Seide et al., 2014), though effective for smooth…

Optimization and Control · Mathematics 2025-10-07 Yuan Gao , Anton Rodomanov , Jeremy Rack , Sebastian Stich

On Arbitrary Compression for Decentralized Consensus and Stochastic Optimization over Directed Networks

We study the decentralized consensus and stochastic optimization problems with compressed communications over static directed graphs. We propose an iterative gradient-based algorithm that compresses messages according to a desired…

Optimization and Control · Mathematics 2022-04-19 Mohammad Taha Toghani , César A. Uribe