Related papers: Adversarial Attacks on Data Attribution

Adversarial Attacks for Tabular Data: Application to Fraud Detection and Imbalanced Data

Guaranteeing the security of transactional systems is a crucial priority of all institutions that process transactions, in order to protect their businesses against cyberattacks and fraudulent attempts. Adversarial attacks are novel…

Cryptography and Security · Computer Science 2021-01-21 Francesco Cartella , Orlando Anunciacao , Yuki Funabiki , Daisuke Yamaguchi , Toru Akishita , Olivier Elshocht

Adversarial training for tabular data with attack propagation

Adversarial attacks are a major concern in security-centered applications, where malicious actors continuously try to mislead Machine Learning (ML) models into wrongly classifying fraudulent activity as legitimate, whereas system…

Machine Learning · Computer Science 2023-07-31 Tiago Leon Melo , João Bravo , Marco O. P. Sampaio , Paolo Romano , Hugo Ferreira , João Tiago Ascensão , Pedro Bizarro

A Data-Centric Approach for Improving Adversarial Training Through the Lens of Out-of-Distribution Detection

Current machine learning models achieve super-human performance in many real-world applications. Still, they are susceptible against imperceptible adversarial perturbations. The most effective solution for this problem is adversarial…

Machine Learning · Computer Science 2023-01-26 Mohammad Azizmalayeri , Arman Zarei , Alireza Isavand , Mohammad Taghi Manzuri , Mohammad Hossein Rohban

On the Fragility of Data Attribution When Learning Is Distributed

Data attribution has become an important component of pricing, auditing, and governance in machine learning pipelines, yet most attribution methods implicitly assume that attribution values faithfully reflect participants' contributions. We…

Machine Learning · Computer Science 2026-05-18 Xian Gao , Bo Hui , Min-Te Sun , Wei-Shinn Ku

The Adversarial Discount -- AI, Signal Correlation, and the Cybersecurity Arms Race

We study a contest-theoretic model of adversarial investment in which an attacker and a defender allocate resources to AI-augmented capabilities across multiple attack surfaces. The attacker's investment operates through two channels: it…

Theoretical Economics · Economics 2026-05-18 James W. Bono

Adversarial Attack Attribution: Discovering Attributable Signals in Adversarial ML Attacks

Machine Learning (ML) models are known to be vulnerable to adversarial inputs and researchers have demonstrated that even production systems, such as self-driving cars and ML-as-a-service offerings, are susceptible. These systems represent…

Machine Learning · Computer Science 2021-01-11 Marissa Dotter , Sherry Xie , Keith Manville , Josh Harguess , Colin Busho , Mikel Rodriguez

Adversarial attacks to image classification systems using evolutionary algorithms

Image classification currently faces significant security challenges due to adversarial attacks, which consist of intentional alterations designed to deceive classification models based on artificial intelligence. This article explores an…

Neural and Evolutionary Computing · Computer Science 2025-07-18 Sergio Nesmachnow , Jamal Toutouh

Attack Agnostic Statistical Method for Adversarial Detection

Deep Learning based AI systems have shown great promise in various domains such as vision, audio, autonomous systems (vehicles, drones), etc. Recent research on neural networks has shown the susceptibility of deep networks to adversarial…

Machine Learning · Computer Science 2019-11-25 Sambuddha Saha , Aashish Kumar , Pratyush Sahay , George Jose , Srinivas Kruthiventi , Harikrishna Muralidhara

On Adversarial Examples and Stealth Attacks in Artificial Intelligence Systems

In this work we present a formal theoretical framework for assessing and analyzing two classes of malevolent action towards generic Artificial Intelligence (AI) systems. Our results apply to general multi-class classifiers that map from an…

Machine Learning · Computer Science 2021-01-01 Ivan Y. Tyukin , Desmond J. Higham , Alexander N. Gorban

Sampling Attacks: Amplification of Membership Inference Attacks by Repeated Queries

Machine learning models have been shown to leak information violating the privacy of their training set. We focus on membership inference attacks on machine learning models which aim to determine whether a data point was used to train the…

Cryptography and Security · Computer Science 2020-09-02 Shadi Rahimian , Tribhuvanesh Orekondy , Mario Fritz

Extending Adversarial Attacks to Produce Adversarial Class Probability Distributions

Despite the remarkable performance and generalization levels of deep learning models in a wide range of artificial intelligence tasks, it has been demonstrated that these models can be easily fooled by the addition of imperceptible yet…

Machine Learning · Computer Science 2023-01-27 Jon Vadillo , Roberto Santana , Jose A. Lozano

Contributor-Aware Defenses Against Adversarial Backdoor Attacks

Deep neural networks for image classification are well-known to be vulnerable to adversarial attacks. One such attack that has garnered recent attention is the adversarial backdoor attack, which has demonstrated the capability to perform…

Cryptography and Security · Computer Science 2022-06-09 Glenn Dawson , Muhammad Umer , Robi Polikar

Better the Devil you Know: An Analysis of Evasion Attacks using Out-of-Distribution Adversarial Examples

A large body of recent work has investigated the phenomenon of evasion attacks using adversarial examples for deep learning systems, where the addition of norm-bounded perturbations to the test inputs leads to incorrect output…

Machine Learning · Computer Science 2019-05-07 Vikash Sehwag , Arjun Nitin Bhagoji , Liwei Song , Chawin Sitawarin , Daniel Cullina , Mung Chiang , Prateek Mittal

Game Theory for Adversarial Attacks and Defenses

Adversarial attacks can generate adversarial inputs by applying small but intentionally worst-case perturbations to samples from the dataset, which leads to even state-of-the-art deep neural networks outputting incorrect answers with high…

Machine Learning · Computer Science 2024-01-08 Shorya Sharma

Are Attribute Inference Attacks Just Imputation?

Models can expose sensitive information about their training data. In an attribute inference attack, an adversary has partial knowledge of some training records and access to a model trained on those records, and infers the unknown values…

Cryptography and Security · Computer Science 2022-09-07 Bargav Jayaraman , David Evans

ATOM: Robustifying Out-of-distribution Detection Using Outlier Mining

Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in an open-world setting. However, existing OOD detection solutions can be brittle in the open world, facing various types of adversarial OOD…

Machine Learning · Computer Science 2021-07-01 Jiefeng Chen , Yixuan Li , Xi Wu , Yingyu Liang , Somesh Jha

Adversarial Attacks and Defenses on 3D Point Cloud Classification: A Survey

Deep learning has successfully solved a wide range of tasks in 2D vision as a dominant AI technique. Recently, deep learning on 3D point clouds is becoming increasingly popular for addressing various tasks in this field. Despite remarkable…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Hanieh Naderi , Ivan V. Bajić

ML-LOO: Detecting Adversarial Examples with Feature Attribution

Deep neural networks obtain state-of-the-art performance on a series of tasks. However, they are easily fooled by adding a small adversarial perturbation to input. The perturbation is often human imperceptible on image data. We observe a…

Machine Learning · Computer Science 2019-06-11 Puyudi Yang , Jianbo Chen , Cho-Jui Hsieh , Jane-Ling Wang , Michael I. Jordan

Adversarial Distributional Training for Robust Deep Learning

Adversarial training (AT) is among the most effective techniques to improve model robustness by augmenting training data with adversarial examples. However, most existing AT methods adopt a specific attack to craft adversarial examples,…

Machine Learning · Computer Science 2020-11-20 Yinpeng Dong , Zhijie Deng , Tianyu Pang , Hang Su , Jun Zhu

Identifying Adversarial Attacks on Text Classifiers

The landscape of adversarial attacks against text classifiers continues to grow, with new attacks developed every year and many of them available in standard toolkits, such as TextAttack and OpenAttack. In response, there is a growing body…

Computation and Language · Computer Science 2022-01-24 Zhouhang Xie , Jonathan Brophy , Adam Noack , Wencong You , Kalyani Asthana , Carter Perkins , Sabrina Reis , Sameer Singh , Daniel Lowd