English
Related papers

Related papers: Robust learning under clean-label attack

200 papers

Data poisoning attacks, in which an adversary corrupts a training set with the goal of inducing specific desired mistakes, have raised substantial concern: even just the possibility of such an attack can make a user no longer trust the…

Machine Learning · Computer Science 2022-03-09 Maria-Florina Balcan , Avrim Blum , Steve Hanneke , Dravyansh Sharma

We study the problem of learning an adversarially robust predictor to test time attacks in the semi-supervised PAC model. We address the question of how many labeled and unlabeled examples are required to ensure learning. We show that…

Machine Learning · Computer Science 2024-05-07 Idan Attias , Steve Hanneke , Yishay Mansour

Data poisoning is a training-time attack that undermines the trustworthiness of learned models. In a targeted data poisoning attack, an adversary manipulates the training dataset to alter the classification of a targeted test point. Given…

Machine Learning · Computer Science 2025-11-18 Nakshatra Gupta , Sumanth Prabhu , Supratik Chakraborty , R Venkatesh

Deep neural networks are vulnerable to backdoor attacks, a type of adversarial attack that poisons the training data to manipulate the behavior of models trained on such data. Clean-label attacks are a more stealthy form of backdoor attacks…

Machine Learning · Computer Science 2024-07-17 Quang H. Nguyen , Nguyen Ngoc-Hieu , The-Anh Ta , Thanh Nguyen-Tang , Kok-Seng Wong , Hoang Thanh-Tung , Khoa D. Doan

In this work, we initiate a formal study of probably approximately correct (PAC) learning under evasion attacks, where the adversary's goal is to \emph{misclassify} the adversarially perturbed sample point $\widetilde{x}$, i.e.,…

Machine Learning · Computer Science 2019-06-14 Dimitrios I. Diochnos , Saeed Mahloujifar , Mohammad Mahmoody

Proper learning refers to the setting in which learners must emit predictors in the underlying hypothesis class $H$, and often leads to learners with simple algorithmic forms (e.g. empirical risk minimization (ERM), structural risk…

Machine Learning · Computer Science 2025-12-10 Julian Asilis , Siddartha Devic , Shaddin Dughmi , Vatsal Sharan , Shang-Hua Teng

Semi-supervised machine learning models learn from a (small) set of labeled training examples, and a (large) set of unlabeled training examples. State-of-the-art models can reach within a few percentage points of fully-supervised training,…

Machine Learning · Computer Science 2021-08-11 Nicholas Carlini

In this paper, we study PAC learnability and certification of predictions under instance-targeted poisoning attacks, where the adversary who knows the test instance may change a fraction of the training set with the goal of fooling the…

Machine Learning · Computer Science 2021-08-10 Ji Gao , Amin Karbasi , Mohammad Mahmoody

Predictions of certifiably robust classifiers remain constant in a neighborhood of a point, making them resilient to test-time attacks with a guarantee. In this work, we present a previously unrecognized threat to robust machine learning…

Machine Learning · Computer Science 2021-03-31 Akshay Mehra , Bhavya Kailkhura , Pin-Yu Chen , Jihun Hamm

It is becoming increasingly important to understand the vulnerability of machine learning models to adversarial attacks. In this paper we study the feasibility of robust learning from the perspective of computational learning theory,…

Machine Learning · Computer Science 2019-09-13 Pascale Gourdeau , Varun Kanade , Marta Kwiatkowska , James Worrell

Continual learning algorithms are typically exposed to untrusted sources that contain training data inserted by adversaries and bad actors. An adversary can insert a small number of poisoned samples, such as mislabeled samples from…

Machine Learning · Computer Science 2023-11-21 Huayu Li , Gregory Ditzler

We study the problem of adversarially robust learning in the transductive setting. For classes $\mathcal{H}$ of bounded VC dimension, we propose a simple transductive learner that when presented with a set of labeled training examples and a…

Machine Learning · Computer Science 2021-10-22 Omar Montasser , Steve Hanneke , Nathan Srebro

We study indiscriminate poisoning for linear learners where an adversary injects a few crafted examples into the training data with the goal of forcing the induced model to incur higher test error. Inspired by the observation that linear…

Machine Learning · Computer Science 2023-11-13 Fnu Suya , Xiao Zhang , Yuan Tian , David Evans

Machine learning algorithms are known to be susceptible to data poisoning attacks, where an adversary manipulates the training data to degrade performance of the resulting classifier. In this work, we present a unifying view of randomized…

Machine Learning · Computer Science 2021-02-24 Elan Rosenfeld , Ezra Winston , Pradeep Ravikumar , J. Zico Kolter

Instance-targeted data poisoning attacks, where an adversary corrupts a training set to induce errors on specific test points, have raised significant concerns. Balcan et al (2022) proposed an approach to addressing this challenge by…

Machine Learning · Computer Science 2025-05-09 Avrim Blum , Donya Saless

We study the problem of learning robust classifiers where the classifier will receive a perturbed input. Unlike robust PAC learning studied in prior work, here the clean data and its label are also adversarially chosen. We formulate this…

Machine Learning · Computer Science 2026-03-02 Sajad Ashkezari

Recently, Montasser et al. [2019] showed that finite VC dimension is not sufficient for proper adversarially robust PAC learning. In light of this hardness, there is a growing effort to study what type of relaxations to the adversarially…

Machine Learning · Computer Science 2023-05-26 Vinod Raman , Unique Subedi , Ambuj Tewari

Recently, Mahloujifar and Mahmoody (TCC'17) studied attacks against learning algorithms using a special case of Valiant's malicious noise, called $p$-tampering, in which the adversary gets to change any training example with independent…

Machine Learning · Computer Science 2018-11-28 Saeed Mahloujifar , Dimitrios I. Diochnos , Mohammad Mahmoody

Optimizing prediction accuracy can come at the expense of fairness. Towards minimizing discrimination against a group, fair machine learning algorithms strive to equalize the behavior of a model across different groups, by imposing a…

Machine Learning · Statistics 2020-06-17 Hongyan Chang , Ta Duy Nguyen , Sasi Kumar Murakonda , Ehsan Kazemi , Reza Shokri

We introduce a novel clean-label targeted poisoning attack on learning mechanisms. While classical poisoning attacks typically corrupt data via addition, modification and omission, our attack focuses on data omission only. Our attack…

Machine Learning · Computer Science 2021-05-06 Guy Barash , Eitan Farchi , Sarit Kraus , Onn Shehory
‹ Prev 1 2 3 10 Next ›