Related papers: Securing Distributed Gradient Descent in High Dime…

Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent

We consider the problem of distributed statistical machine learning in adversarial settings, where some unknown and time-varying subset of working machines may be compromised and behave arbitrarily to prevent an accurate model from being…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-24 Yudong Chen , Lili Su , Jiaming Xu

Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

In large-scale distributed learning, security issues have become increasingly important. Particularly in a decentralized environment, some computing units may behave abnormally, or even exhibit Byzantine failures -- arbitrary and…

Machine Learning · Computer Science 2021-02-26 Dong Yin , Yudong Chen , Kannan Ramchandran , Peter Bartlett

Byzantine-Resilient SGD in High Dimensions on Heterogeneous Data

We study distributed stochastic gradient descent (SGD) in the master-worker architecture under Byzantine attacks. We consider the heterogeneous data model, where different workers may have different local datasets, and we do not make any…

Machine Learning · Statistics 2020-05-19 Deepesh Data , Suhas Diggavi

Communication-Efficient and Byzantine-Robust Distributed Learning with Error Feedback

We develop a communication-efficient distributed learning algorithm that is robust against Byzantine worker machines. We propose and analyze a distributed gradient-descent algorithm that performs a simple thresholding based on gradient…

Machine Learning · Computer Science 2021-08-17 Avishek Ghosh , Raj Kumar Maity , Swanand Kadhe , Arya Mazumdar , Kannan Ramchandran

High Dimensional Distributed Gradient Descent with Arbitrary Number of Byzantine Attackers

Adversarial attacks pose a major challenge to distributed learning systems, prompting the development of numerous robust learning methods. However, most existing approaches suffer from the curse of dimensionality, i.e. the error increases…

Machine Learning · Computer Science 2025-11-19 Wenyu Liu , Tianqiang Huang , Pengfei Zhang , Zong Ke , Minghui Min , Puning Zhao

Unveiling High-Probability Generalization in Decentralized SGD

Decentralized stochastic gradient descent (D-SGD) is an efficient method for large-scale distributed learning. Existing generalization studies mainly address expected results, achieving rates limited to $\mathcal{O}\left(\frac{1}{\delta…

Machine Learning · Computer Science 2026-05-12 Jiahuan Wang , Ping Luo , Ziqing Wen , Dongsheng Li , Tao Sun

Distributed Byzantine Tolerant Stochastic Gradient Descent in the Era of Big Data

The recent advances in sensor technologies and smart devices enable the collaborative collection of a sheer volume of data from multiple information sources. As a promising tool to efficiently extract useful information from such big data,…

Machine Learning · Computer Science 2019-03-08 Richeng Jin , Xiaofan He , Huaiyu Dai

Aspis: Robust Detection for Distributed Learning

State-of-the-art machine learning models are routinely trained on large-scale distributed clusters. Crucially, such systems can be compromised when some of the computing devices exhibit abnormal (Byzantine) behavior and return arbitrary…

Machine Learning · Computer Science 2022-01-25 Konstantinos Konstantinidis , Aditya Ramamoorthy

Detection and Mitigation of Byzantine Attacks in Distributed Training

A plethora of modern machine learning tasks require the utilization of large-scale distributed clusters as a critical component of the training pipeline. However, abnormal Byzantine behavior of the worker nodes can derail the training and…

Machine Learning · Computer Science 2023-05-16 Konstantinos Konstantinidis , Namrata Vaswani , Aditya Ramamoorthy

Resilient Two-Time-Scale Local Stochastic Gradient Descent for Byzantine Federated Learning

We study local stochastic gradient descent methods for solving federated optimization over a network of agents communicating indirectly through a centralized coordinator. We are interested in the Byzantine setting where there is a subset of…

Optimization and Control · Mathematics 2024-09-06 Amit Dutta , Thinh T. Doan

dSTAR: Straggler Tolerant and Byzantine Resilient Distributed SGD

Distributed model training needs to be adapted to challenges such as the straggler effect and Byzantine attacks. When coordinating the training process with multiple computing nodes, ensuring timely and reliable gradient aggregation amidst…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-11 Jiahe Yan , Pratik Chaudhari , Leonard Kleinrock

Efficient Byzantine-Resilient Stochastic Gradient Desce

Distributed Learning often suffers from Byzantine failures, and there have been a number of works studying the problem of distributed stochastic optimization under Byzantine failures, where only a portion of workers, instead of all the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-08-17 Kaiyun Li , Xiaojun Chen , Ye Dong , Peng Zhang , Dakui Wang , Shuai Zen

DRACO: Byzantine-resilient Distributed Training via Redundant Gradients

Distributed model training is vulnerable to byzantine system failures and adversarial compute nodes, i.e., nodes that use malicious updates to corrupt the global model stored at a parameter server (PS). To guarantee some form of robustness,…

Machine Learning · Statistics 2018-06-25 Lingjiao Chen , Hongyi Wang , Zachary Charles , Dimitris Papailiopoulos

Gradient Manipulation in Distributed Stochastic Gradient Descent with Strategic Agents: Truthful Incentives with Convergence Guarantees

Distributed learning has gained significant attention due to its advantages in scalability, privacy, and fault tolerance.In this paradigm, multiple agents collaboratively train a global model by exchanging parameters only with their…

Machine Learning · Computer Science 2026-03-31 Ziqin Chen , Yongqiang Wang

Byzantine Robustness and Partial Participation Can Be Achieved at Once: Just Clip Gradient Differences

Distributed learning has emerged as a leading paradigm for training large machine learning models. However, in real-world scenarios, participants may be unreliable or malicious, posing a significant challenge to the integrity and accuracy…

Machine Learning · Computer Science 2024-06-10 Grigory Malinovsky , Peter Richtárik , Samuel Horváth , Eduard Gorbunov

The Hidden Vulnerability of Distributed Learning in Byzantium

While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of…

Machine Learning · Statistics 2018-07-19 El Mahdi El Mhamdi , Rachid Guerraoui , Sébastien Rouault

Byzantine-Robust Variance-Reduced Federated Learning over Distributed Non-i.i.d. Data

We consider the federated learning problem where data on workers are not independent and identically distributed (i.i.d.). During the learning process, an unknown number of Byzantine workers may send malicious messages to the central node,…

Machine Learning · Computer Science 2021-08-31 Jie Peng , Zhaoxian Wu , Qing Ling , Tianyi Chen

Interactive Byzantine-Resilient Gradient Coding for General Data Assignments

We tackle the problem of Byzantine errors in distributed gradient descent within the Byzantine-resilient gradient coding framework. Our proposed solution can recover the exact full gradient in the presence of $s$ malicious workers with a…

Information Theory · Computer Science 2024-01-31 Shreyas Jain , Luis Maßny , Christoph Hofmeister , Eitan Yaakobi , Rawad Bitar

ByRDiE: Byzantine-resilient distributed coordinate descent for decentralized learning

Distributed machine learning algorithms enable learning of models from datasets that are distributed over a network without gathering the data at a centralized location. While efficient distributed algorithms have been developed under the…

Machine Learning · Computer Science 2020-07-07 Zhixiong Yang , Waheed U. Bajwa

Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning

We study robust distributed learning that involves minimizing a non-convex loss function with saddle points. We consider the Byzantine setting where some worker machines have abnormal or even arbitrary and adversarial behavior. In this…

Machine Learning · Computer Science 2020-07-30 Dong Yin , Yudong Chen , Kannan Ramchandran , Peter Bartlett