Related papers: Aspis: Robust Detection for Distributed Learning

Detection and Mitigation of Byzantine Attacks in Distributed Training

A plethora of modern machine learning tasks require the utilization of large-scale distributed clusters as a critical component of the training pipeline. However, abnormal Byzantine behavior of the worker nodes can derail the training and…

Machine Learning · Computer Science 2023-05-16 Konstantinos Konstantinidis , Namrata Vaswani , Aditya Ramamoorthy

ByzShield: An Efficient and Robust System for Distributed Training

Training of large scale models on distributed clusters is a critical component of the machine learning pipeline. However, this training can easily be made to fail if some workers behave in an adversarial (Byzantine) fashion whereby they…

Machine Learning · Computer Science 2021-03-05 Konstantinos Konstantinidis , Aditya Ramamoorthy

Distributed Statistical Machine Learning in Adversarial Settings: Byzantine Gradient Descent

We consider the problem of distributed statistical machine learning in adversarial settings, where some unknown and time-varying subset of working machines may be compromised and behave arbitrarily to prevent an accurate model from being…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-24 Yudong Chen , Lili Su , Jiaming Xu

Robust Distributed Learning Against Both Distributional Shifts and Byzantine Attacks

In distributed learning systems, robustness issues may arise from two sources. On one hand, due to distributional shifts between training data and test data, the trained model could exhibit poor out-of-sample performance. On the other hand,…

Machine Learning · Computer Science 2022-11-01 Guanqiang Zhou , Ping Xu , Yue Wang , Zhi Tian

DRACO: Byzantine-resilient Distributed Training via Redundant Gradients

Distributed model training is vulnerable to byzantine system failures and adversarial compute nodes, i.e., nodes that use malicious updates to corrupt the global model stored at a parameter server (PS). To guarantee some form of robustness,…

Machine Learning · Statistics 2018-06-25 Lingjiao Chen , Hongyi Wang , Zachary Charles , Dimitris Papailiopoulos

Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

In large-scale distributed learning, security issues have become increasingly important. Particularly in a decentralized environment, some computing units may behave abnormally, or even exhibit Byzantine failures -- arbitrary and…

Machine Learning · Computer Science 2021-02-26 Dong Yin , Yudong Chen , Kannan Ramchandran , Peter Bartlett

ByGARS: Byzantine SGD with Arbitrary Number of Attackers

We propose two novel stochastic gradient descent algorithms, ByGARS and ByGARS++, for distributed machine learning in the presence of any number of Byzantine adversaries. In these algorithms, reputation scores of workers are computed using…

Machine Learning · Computer Science 2020-12-09 Jayanth Regatti , Hao Chen , Abhishek Gupta

Tolerating Adversarial Attacks and Byzantine Faults in Distributed Machine Learning

Adversarial attacks attempt to disrupt the training, retraining and utilizing of artificial intelligence and machine learning models in large-scale distributed machine learning systems. This causes security risks on its prediction outcome.…

Cryptography and Security · Computer Science 2021-09-07 Yusen Wu , Hao Chen , Xin Wang , Chao Liu , Phuong Nguyen , Yelena Yesha

RSA: Byzantine-Robust Stochastic Aggregation Methods for Distributed Learning from Heterogeneous Datasets

In this paper, we propose a class of robust stochastic subgradient methods for distributed learning from heterogeneous datasets at presence of an unknown number of Byzantine workers. The Byzantine workers, during the learning process, may…

Machine Learning · Computer Science 2019-11-12 Liping Li , Wei Xu , Tianyi Chen , Georgios B. Giannakis , Qing Ling

Interactive Byzantine-Resilient Gradient Coding for General Data Assignments

We tackle the problem of Byzantine errors in distributed gradient descent within the Byzantine-resilient gradient coding framework. Our proposed solution can recover the exact full gradient in the presence of $s$ malicious workers with a…

Information Theory · Computer Science 2024-01-31 Shreyas Jain , Luis Maßny , Christoph Hofmeister , Eitan Yaakobi , Rawad Bitar

From Inexact Gradients to Byzantine Robustness: Acceleration and Optimization under Similarity

Standard federated learning algorithms are vulnerable to adversarial nodes, a.k.a. Byzantine failures. To solve this issue, robust distributed learning algorithms have been developed, which typically replace parameter averaging by robust…

Machine Learning · Computer Science 2026-02-04 Renaud Gaucher , Aymeric Dieuleveut , Hadrien Hendrikx

Byzantine-Robust Variance-Reduced Federated Learning over Distributed Non-i.i.d. Data

We consider the federated learning problem where data on workers are not independent and identically distributed (i.i.d.). During the learning process, an unknown number of Byzantine workers may send malicious messages to the central node,…

Machine Learning · Computer Science 2021-08-31 Jie Peng , Zhaoxian Wu , Qing Ling , Tianyi Chen

Byzantine Robustness and Partial Participation Can Be Achieved at Once: Just Clip Gradient Differences

Distributed learning has emerged as a leading paradigm for training large machine learning models. However, in real-world scenarios, participants may be unreliable or malicious, posing a significant challenge to the integrity and accuracy…

Machine Learning · Computer Science 2024-06-10 Grigory Malinovsky , Peter Richtárik , Samuel Horváth , Eduard Gorbunov

Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning

We study robust distributed learning that involves minimizing a non-convex loss function with saddle points. We consider the Byzantine setting where some worker machines have abnormal or even arbitrary and adversarial behavior. In this…

Machine Learning · Computer Science 2020-07-30 Dong Yin , Yudong Chen , Kannan Ramchandran , Peter Bartlett

Securing Distributed Gradient Descent in High Dimensional Statistical Learning

We consider unreliable distributed learning systems wherein the training data is kept confidential by external workers, and the learner has to interact closely with those workers to train a model. In particular, we assume that there exists…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-05-10 Lili Su , Jiaming Xu

Byzantine-robust Federated Learning through Collaborative Malicious Gradient Filtering

Gradient-based training in federated learning is known to be vulnerable to faulty/malicious clients, which are often modeled as Byzantine clients. To this end, previous work either makes use of auxiliary data at parameter server to verify…

Machine Learning · Computer Science 2023-05-02 Jian Xu , Shao-Lun Huang , Linqi Song , Tian Lan

A Little Is Enough: Circumventing Defenses For Distributed Learning

Distributed learning is central for large-scale training of deep-learning models. However, they are exposed to a security threat in which Byzantine participants can interrupt or control the learning process. Previous attack models and their…

Machine Learning · Computer Science 2021-11-16 Moran Baruch , Gilad Baruch , Yoav Goldberg

Distributed Byzantine Tolerant Stochastic Gradient Descent in the Era of Big Data

The recent advances in sensor technologies and smart devices enable the collaborative collection of a sheer volume of data from multiple information sources. As a promising tool to efficiently extract useful information from such big data,…

Machine Learning · Computer Science 2019-03-08 Richeng Jin , Xiaofan He , Huaiyu Dai

dSTAR: Straggler Tolerant and Byzantine Resilient Distributed SGD

Distributed model training needs to be adapted to challenges such as the straggler effect and Byzantine attacks. When coordinating the training process with multiple computing nodes, ensuring timely and reliable gradient aggregation amidst…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-11 Jiahe Yan , Pratik Chaudhari , Leonard Kleinrock

The Hidden Vulnerability of Distributed Learning in Byzantium

While machine learning is going through an era of celebrated success, concerns have been raised about the vulnerability of its backbone: stochastic gradient descent (SGD). Recent approaches have been proposed to ensure the robustness of…

Machine Learning · Statistics 2018-07-19 El Mahdi El Mhamdi , Rachid Guerraoui , Sébastien Rouault