Related papers: Robust learning under clean-label attack

Robustly-reliable learners under poisoning attacks

Data poisoning attacks, in which an adversary corrupts a training set with the goal of inducing specific desired mistakes, have raised substantial concern: even just the possibility of such an attack can make a user no longer trust the…

Machine Learning · Computer Science 2022-03-09 Maria-Florina Balcan , Avrim Blum , Steve Hanneke , Dravyansh Sharma

A Characterization of Semi-Supervised Adversarially-Robust PAC Learnability

We study the problem of learning an adversarially robust predictor to test time attacks in the semi-supervised PAC model. We address the question of how many labeled and unlabeled examples are required to ensure learning. We show that…

Machine Learning · Computer Science 2024-05-07 Idan Attias , Steve Hanneke , Yishay Mansour

On Robustness of Linear Classifiers to Targeted Data Poisoning

Data poisoning is a training-time attack that undermines the trustworthiness of learned models. In a targeted data poisoning attack, an adversary manipulates the training dataset to alter the classification of a targeted test point. Given…

Machine Learning · Computer Science 2025-11-18 Nakshatra Gupta , Sumanth Prabhu , Supratik Chakraborty , R Venkatesh

Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Deep neural networks are vulnerable to backdoor attacks, a type of adversarial attack that poisons the training data to manipulate the behavior of models trained on such data. Clean-label attacks are a more stealthy form of backdoor attacks…

Machine Learning · Computer Science 2024-07-17 Quang H. Nguyen , Nguyen Ngoc-Hieu , The-Anh Ta , Thanh Nguyen-Tang , Kok-Seng Wong , Hoang Thanh-Tung , Khoa D. Doan

Lower Bounds for Adversarially Robust PAC Learning

In this work, we initiate a formal study of probably approximately correct (PAC) learning under evasion attacks, where the adversary's goal is to \emph{misclassify} the adversarially perturbed sample point $\widetilde{x}$, i.e.,…

Machine Learning · Computer Science 2019-06-14 Dimitrios I. Diochnos , Saeed Mahloujifar , Mohammad Mahmoody

Proper Learnability and the Role of Unlabeled Data

Proper learning refers to the setting in which learners must emit predictors in the underlying hypothesis class $H$, and often leads to learners with simple algorithmic forms (e.g. empirical risk minimization (ERM), structural risk…

Machine Learning · Computer Science 2025-12-10 Julian Asilis , Siddartha Devic , Shaddin Dughmi , Vatsal Sharan , Shang-Hua Teng

Poisoning the Unlabeled Dataset of Semi-Supervised Learning

Semi-supervised machine learning models learn from a (small) set of labeled training examples, and a (large) set of unlabeled training examples. State-of-the-art models can reach within a few percentage points of fully-supervised training,…

Machine Learning · Computer Science 2021-08-11 Nicholas Carlini

Learning and Certification under Instance-targeted Poisoning

In this paper, we study PAC learnability and certification of predictions under instance-targeted poisoning attacks, where the adversary who knows the test instance may change a fraction of the training set with the goal of fooling the…

Machine Learning · Computer Science 2021-08-10 Ji Gao , Amin Karbasi , Mohammad Mahmoody

How Robust are Randomized Smoothing based Defenses to Data Poisoning?

Predictions of certifiably robust classifiers remain constant in a neighborhood of a point, making them resilient to test-time attacks with a guarantee. In this work, we present a previously unrecognized threat to robust machine learning…

Machine Learning · Computer Science 2021-03-31 Akshay Mehra , Bhavya Kailkhura , Pin-Yu Chen , Jihun Hamm

On the Hardness of Robust Classification

It is becoming increasingly important to understand the vulnerability of machine learning models to adversarial attacks. In this paper we study the feasibility of robust learning from the perspective of computational learning theory,…

Machine Learning · Computer Science 2019-09-13 Pascale Gourdeau , Varun Kanade , Marta Kwiatkowska , James Worrell

PACOL: Poisoning Attacks Against Continual Learners

Continual learning algorithms are typically exposed to untrusted sources that contain training data inserted by adversaries and bad actors. An adversary can insert a small number of poisoned samples, such as mislabeled samples from…

Machine Learning · Computer Science 2023-11-21 Huayu Li , Gregory Ditzler

Transductive Robust Learning Guarantees

We study the problem of adversarially robust learning in the transductive setting. For classes $\mathcal{H}$ of bounded VC dimension, we propose a simple transductive learner that when presented with a set of labeled training examples and a…

Machine Learning · Computer Science 2021-10-22 Omar Montasser , Steve Hanneke , Nathan Srebro

What Distributions are Robust to Indiscriminate Poisoning Attacks for Linear Learners?

We study indiscriminate poisoning for linear learners where an adversary injects a few crafted examples into the training data with the goal of forcing the induced model to incur higher test error. Inspired by the observation that linear…

Machine Learning · Computer Science 2023-11-13 Fnu Suya , Xiao Zhang , Yuan Tian , David Evans

Certified Robustness to Label-Flipping Attacks via Randomized Smoothing

Machine learning algorithms are known to be susceptible to data poisoning attacks, where an adversary manipulates the training data to degrade performance of the resulting classifier. In this work, we present a unifying view of randomized…

Machine Learning · Computer Science 2021-02-24 Elan Rosenfeld , Ezra Winston , Pradeep Ravikumar , J. Zico Kolter

Regularized Robustly Reliable Learners and Instance Targeted Attacks

Instance-targeted data poisoning attacks, where an adversary corrupts a training set to induce errors on specific test points, have raised significant concerns. Balcan et al (2022) proposed an approach to addressing this challenge by…

Machine Learning · Computer Science 2025-05-09 Avrim Blum , Donya Saless

Robust Online Learning

We study the problem of learning robust classifiers where the classifier will receive a perturbed input. Unlike robust PAC learning studied in prior work, here the clean data and its label are also adversarially chosen. We formulate this…

Machine Learning · Computer Science 2026-03-02 Sajad Ashkezari

On Proper Learnability between Average- and Worst-case Robustness

Recently, Montasser et al. [2019] showed that finite VC dimension is not sufficient for proper adversarially robust PAC learning. In light of this hardness, there is a growing effort to study what type of relaxations to the adversarially…

Machine Learning · Computer Science 2023-05-26 Vinod Raman , Unique Subedi , Ambuj Tewari

Learning under $p$-Tampering Attacks

Recently, Mahloujifar and Mahmoody (TCC'17) studied attacks against learning algorithms using a special case of Valiant's malicious noise, called $p$-tampering, in which the adversary gets to change any training example with independent…

Machine Learning · Computer Science 2018-11-28 Saeed Mahloujifar , Dimitrios I. Diochnos , Mohammad Mahmoody

On Adversarial Bias and the Robustness of Fair Machine Learning

Optimizing prediction accuracy can come at the expense of fairness. Towards minimizing discrimination against a group, fair machine learning algorithms strive to equalize the behavior of a model across different groups, by imposing a…

Machine Learning · Statistics 2020-06-17 Hongyan Chang , Ta Duy Nguyen , Sasi Kumar Murakonda , Ehsan Kazemi , Reza Shokri

Broadly Applicable Targeted Data Sample Omission Attacks

We introduce a novel clean-label targeted poisoning attack on learning mechanisms. While classical poisoning attacks typically corrupt data via addition, modification and omission, our attack focuses on data omission only. Our attack…

Machine Learning · Computer Science 2021-05-06 Guy Barash , Eitan Farchi , Sarit Kraus , Onn Shehory