Related papers: Poisoning Attacks on Algorithmic Fairness

Poisoning Attacks on Fair Machine Learning

Both fair machine learning and adversarial learning have been extensively studied. However, attacking fair machine learning models has received less attention. In this paper, we present a framework that seeks to effectively generate…

Machine Learning · Computer Science 2021-10-19 Minh-Hao Van , Wei Du , Xintao Wu , Aidong Lu

Adversarial Bias: Data Poisoning Attacks on Fairness

With the growing adoption of AI and machine learning systems in real-world applications, ensuring their fairness has become increasingly critical. The majority of the work in algorithmic fairness focus on assessing and improving the…

Machine Learning · Computer Science 2025-11-12 Eunice Chan , Hanghang Tong

Exacerbating Algorithmic Bias through Fairness Attacks

Algorithmic fairness has attracted significant attention in recent years, with many quantitative measures suggested for characterizing the fairness of different machine learning algorithms. Despite this interest, the robustness of those…

Machine Learning · Computer Science 2020-12-17 Ninareh Mehrabi , Muhammad Naveed , Fred Morstatter , Aram Galstyan

Towards Class-Oriented Poisoning Attacks Against Neural Networks

Poisoning attacks on machine learning systems compromise the model performance by deliberately injecting malicious samples in the training dataset to influence the training process. Prior works focus on either availability attacks (i.e.,…

Machine Learning · Computer Science 2021-10-13 Bingyin Zhao , Yingjie Lao

Towards Poisoning of Deep Learning Algorithms with Back-gradient Optimization

A number of online services nowadays rely upon machine learning to extract valuable information from data collected in the wild. This exposes learning algorithms to the threat of data poisoning, i.e., a coordinate attack in which a fraction…

Machine Learning · Computer Science 2017-08-30 Luis Muñoz-González , Battista Biggio , Ambra Demontis , Andrea Paudice , Vasin Wongrassamee , Emil C. Lupu , Fabio Roli

Poisoning Attacks with Generative Adversarial Nets

Machine learning algorithms are vulnerable to poisoning attacks: An adversary can inject malicious points in the training dataset to influence the learning process and degrade the algorithm's performance. Optimal poisoning attacks have…

Machine Learning · Computer Science 2019-09-26 Luis Muñoz-González , Bjarne Pfitzner , Matteo Russo , Javier Carnerero-Cano , Emil C. Lupu

Online Poisoning Attack Against Reinforcement Learning under Black-box Environments

This paper proposes an online environment poisoning algorithm tailored for reinforcement learning agents operating in a black-box setting, where an adversary deliberately manipulates training data to lead the agent toward a mischievous…

Machine Learning · Computer Science 2024-12-03 Jianhui Li , Bokang Zhang , Junfeng Wu

On Adversarial Bias and the Robustness of Fair Machine Learning

Optimizing prediction accuracy can come at the expense of fairness. Towards minimizing discrimination against a group, fair machine learning algorithms strive to equalize the behavior of a model across different groups, by imposing a…

Machine Learning · Statistics 2020-06-17 Hongyan Chang , Ta Duy Nguyen , Sasi Kumar Murakonda , Ehsan Kazemi , Reza Shokri

Towards Fair Classification against Poisoning Attacks

Fair classification aims to stress the classification models to achieve the equality (treatment or prediction quality) among different sensitive groups. However, fair classification can be under the risk of poisoning attacks that…

Machine Learning · Computer Science 2022-10-19 Han Xu , Xiaorui Liu , Yuxuan Wan , Jiliang Tang

Detection of Adversarial Training Examples in Poisoning Attacks through Anomaly Detection

Machine learning has become an important component for many systems and applications including computer vision, spam filtering, malware and network intrusion detection, among others. Despite the capabilities of machine learning algorithms…

Machine Learning · Statistics 2018-02-14 Andrea Paudice , Luis Muñoz-González , Andras Gyorgy , Emil C. Lupu

Not All Poisons are Created Equal: Robust Training against Data Poisoning

Data poisoning causes misclassification of test time target examples by injecting maliciously crafted samples in the training data. Existing defenses are often effective only against a specific type of targeted attack, significantly degrade…

Machine Learning · Computer Science 2022-10-19 Yu Yang , Tian Yu Liu , Baharan Mirzasoleiman

Transferable Availability Poisoning Attacks

We consider availability data poisoning attacks, where an adversary aims to degrade the overall test accuracy of a machine learning model by crafting small perturbations to its training data. Existing poisoning strategies can achieve the…

Cryptography and Security · Computer Science 2024-06-07 Yiyong Liu , Michael Backes , Xiao Zhang

Data Poisoning Attacks against Online Learning

We consider data poisoning attacks, a class of adversarial attacks on machine learning where an adversary has the power to alter a small fraction of the training data in order to make the trained classifier satisfy certain objectives. While…

Machine Learning · Computer Science 2018-08-29 Yizhen Wang , Kamalika Chaudhuri

Poisoning Attacks and Defenses on Artificial Intelligence: A Survey

Machine learning models have been widely adopted in several fields. However, most recent studies have shown several vulnerabilities from attacks with a potential to jeopardize the integrity of the model, presenting a new window of research…

Cryptography and Security · Computer Science 2022-02-23 Miguel A. Ramirez , Song-Kyoo Kim , Hussam Al Hamadi , Ernesto Damiani , Young-Ji Byon , Tae-Yeon Kim , Chung-Suk Cho , Chan Yeob Yeun

Witches' Brew: Industrial Scale Data Poisoning via Gradient Matching

Data Poisoning attacks modify training data to maliciously control a model trained on such data. In this work, we focus on targeted poisoning attacks which cause a reclassification of an unmodified test image and as such breach model…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Jonas Geiping , Liam Fowl , W. Ronny Huang , Wojciech Czaja , Gavin Taylor , Michael Moeller , Tom Goldstein

Manipulating Machine Learning: Poisoning Attacks and Countermeasures for Regression Learning

As machine learning becomes widely used for automated decisions, attackers have strong incentives to manipulate the results and models generated by machine learning algorithms. In this paper, we perform the first systematic study of…

Cryptography and Security · Computer Science 2021-09-29 Matthew Jagielski , Alina Oprea , Battista Biggio , Chang Liu , Cristina Nita-Rotaru , Bo Li

Model-Targeted Poisoning Attacks with Provable Convergence

In a poisoning attack, an adversary with control over a small fraction of the training data attempts to select that data in a way that induces a corrupted model that misbehaves in favor of the adversary. We consider poisoning attacks…

Machine Learning · Computer Science 2021-04-22 Fnu Suya , Saeed Mahloujifar , Anshuman Suri , David Evans , Yuan Tian

Outlier-Oriented Poisoning Attack: A Grey-box Approach to Disturb Decision Boundaries by Perturbing Outliers in Multiclass Learning

Poisoning attacks are a primary threat to machine learning models, aiming to compromise their performance and reliability by manipulating training datasets. This paper introduces a novel attack - Outlier-Oriented Poisoning (OOP) attack,…

Machine Learning · Computer Science 2024-11-04 Anum Paracha , Junaid Arshad , Mohamed Ben Farah , Khalid Ismail

Inverting Gradient Attacks Makes Powerful Data Poisoning

Gradient attacks and data poisoning tamper with the training of machine learning algorithms to maliciously alter them and have been proven to be equivalent in convex settings. The extent of harm these attacks can produce in non-convex…

Machine Learning · Computer Science 2024-12-12 Wassim Bouaziz , El-Mahdi El-Mhamdi , Nicolas Usunier

Indiscriminate Data Poisoning Attacks on Neural Networks

Data poisoning attacks, in which a malicious adversary aims to influence a model by injecting "poisoned" data into the training process, have attracted significant recent attention. In this work, we take a closer look at existing poisoning…

Machine Learning · Computer Science 2024-02-16 Yiwei Lu , Gautam Kamath , Yaoliang Yu