Related papers: Radioactive data: tracing through training

Certified Robustness to Data Poisoning in Gradient-Based Training

Modern machine learning pipelines leverage large amounts of public data, making it infeasible to guarantee data quality and leaving models open to poisoning and backdoor attacks. Provably bounding model behavior under such attacks remains…

Machine Learning · Computer Science 2024-10-31 Philip Sosnin , Mark N. Müller , Maximilian Baader , Calvin Tsay , Matthew Wicker

Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing

Backdoor data poisoning is an emerging form of adversarial attack usually against deep neural network image classifiers. The attacker poisons the training set with a relatively small set of images from one (or several) source class(es),…

Machine Learning · Computer Science 2020-10-16 Zhen Xiang , David J. Miller , George Kesidis

Reconstructing Training Data from Model Gradient, Provably

Understanding when and how much a model gradient leaks information about the training sample is an important question in privacy. In this paper, we present a surprising result: even without training or memorizing the data, we can fully…

Machine Learning · Computer Science 2023-06-13 Zihan Wang , Jason D. Lee , Qi Lei

DAD: Data-free Adversarial Defense at Test Time

Deep models are highly susceptible to adversarial attacks. Such attacks are carefully crafted imperceptible noises that can fool the network and can cause severe consequences when deployed. To encounter them, the model requires training…

Machine Learning · Computer Science 2022-04-11 Gaurav Kumar Nayak , Ruchit Rawal , Anirban Chakraborty

On the Effectiveness of Dataset Watermarking in Adversarial Settings

In a data-driven world, datasets constitute a significant economic value. Dataset owners who spend time and money to collect and curate the data are incentivized to ensure that their datasets are not used in ways that they did not…

Cryptography and Security · Computer Science 2022-02-28 Buse Gul Atli Tekgul , N. Asokan

Spectral Signatures in Backdoor Attacks

A recent line of work has uncovered a new form of data poisoning: so-called \emph{backdoor} attacks. These attacks are particularly dangerous because they do not affect a network's behavior on typical, benign data. Rather, the network only…

Machine Learning · Computer Science 2018-11-05 Brandon Tran , Jerry Li , Aleksander Madry

D\'ej\`a Vu: an empirical evaluation of the memorization properties of ConvNets

Convolutional neural networks memorize part of their training data, which is why strategies such as data augmentation and drop-out are employed to mitigate overfitting. This paper considers the related question of "membership inference",…

Computer Vision and Pattern Recognition · Computer Science 2018-09-19 Alexandre Sablayrolles , Matthijs Douze , Cordelia Schmid , Hervé Jégou

Anonymity Unveiled: A Practical Framework for Auditing Data Use in Deep Learning Models

The rise of deep learning (DL) has led to a surging demand for training data, which incentivizes the creators of DL models to trawl through the Internet for training materials. Meanwhile, users often have limited control over whether their…

Cryptography and Security · Computer Science 2025-05-23 Zitao Chen , Karthik Pattabiraman

Detection under Privileged Information

For well over a quarter century, detection systems have been driven by models learned from input features collected from real or simulated environments. An artifact (e.g., network event, potential malware sample, suspicious email) is deemed…

Cryptography and Security · Computer Science 2018-04-03 Z. Berkay Celik , Patrick McDaniel , Rauf Izmailov , Nicolas Papernot , Ryan Sheatsley , Raquel Alvarez , Ananthram Swami

Revealing Perceptible Backdoors, without the Training Set, via the Maximum Achievable Misclassification Fraction Statistic

Recently, a backdoor data poisoning attack was proposed, which adds mislabeled examples to the training set, with an embedded backdoor pattern, aiming to have the classifier learn to classify to a target class whenever the backdoor pattern…

Machine Learning · Computer Science 2023-07-17 Zhen Xiang , David J. Miller , Hang Wang , George Kesidis

Property Inference From Poisoning

Property inference attacks consider an adversary who has access to the trained model and tries to extract some global statistics of the training data. In this work, we study property inference in scenarios where the adversary can…

Machine Learning · Computer Science 2021-01-28 Melissa Chase , Esha Ghosh , Saeed Mahloujifar

Deep Probabilistic Models to Detect Data Poisoning Attacks

Data poisoning attacks compromise the integrity of machine-learning models by introducing malicious training samples to influence the results during test time. In this work, we investigate backdoor data poisoning attack on deep neural…

Machine Learning · Computer Science 2019-12-04 Mahesh Subedar , Nilesh Ahuja , Ranganath Krishnan , Ibrahima J. Ndiour , Omesh Tickoo

Quantifying the Privacy Risks of Learning High-Dimensional Graphical Models

Models leak information about their training data. This enables attackers to infer sensitive information about their training sets, notably determine if a data sample was part of the model's training set. The existing works empirically show…

Machine Learning · Statistics 2021-02-18 Sasi Kumar Murakonda , Reza Shokri , George Theodorakopoulos

Unlearning Traces the Influential Training Data of Language Models

Identifying the training datasets that influence a language model's outputs is essential for minimizing the generation of harmful content and enhancing its performance. Ideally, we can measure the influence of each dataset by removing it…

Computation and Language · Computer Science 2024-06-14 Masaru Isonuma , Ivan Titov

Better Membership Inference Privacy Measurement through Discrepancy

Membership Inference Attacks have emerged as a dominant method for empirically measuring privacy leakage from machine learning models. Here, privacy is measured by the {\em{advantage}} or gap between a score or a function computed on the…

Machine Learning · Computer Science 2024-05-27 Ruihan Wu , Pengrun Huang , Kamalika Chaudhuri

Privacy-Preserving Eye-tracking Using Deep Learning

The expanding usage of complex machine learning methods like deep learning has led to an explosion in human activity recognition, particularly applied to health. In particular, as part of a larger body sensor network system, face and…

Computer Vision and Pattern Recognition · Computer Science 2021-06-24 Salman Seyedi , Zifan Jiang , Allan Levey , Gari D. Clifford

Confidence Is All You Need for MI Attacks

In this evolving era of machine learning security, membership inference attacks have emerged as a potent threat to the confidentiality of sensitive data. In this attack, adversaries aim to determine whether a particular point was used…

Machine Learning · Computer Science 2024-06-21 Abhishek Sinha , Himanshi Tibrewal , Mansi Gupta , Nikhar Waghela , Shivank Garg

Transfer Learning-Based Model Protection With Secret Key

We propose a novel method for protecting trained models with a secret key so that unauthorized users without the correct key cannot get the correct inference. By taking advantage of transfer learning, the proposed method enables us to train…

Machine Learning · Computer Science 2021-03-08 MaungMaung AprilPyone , Hitoshi Kiya

Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models

The ability to accurately estimate risk of developing breast cancer would be invaluable for clinical decision-making. One promising new approach is to integrate image-based risk models based on deep neural networks. However, one must take…

Image and Video Processing · Electrical Eng. & Systems 2020-09-17 Yue Liu , Hossein Azizpour , Fredrik Strand , Kevin Smith

Securing Traffic Sign Recognition Systems in Autonomous Vehicles

Deep Neural Networks (DNNs) are widely used for traffic sign recognition because they can automatically extract high-level features from images. These DNNs are trained on large-scale datasets obtained from unknown sources. Therefore, it is…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Thushari Hapuarachchi , Long Dang , Kaiqi Xiong