Related papers: Defending Against Patch-based Backdoor Attacks on …

Backdoor Attacks on Self-Supervised Learning

Large-scale unlabeled data has spurred recent progress in self-supervised learning methods that learn rich visual representations. State-of-the-art self-supervised methods for learning representations from images (e.g., MoCo, BYOL, MSF) use…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Aniruddha Saha , Ajinkya Tejankar , Soroush Abbasi Koohpayegani , Hamed Pirsiavash

An Embarrassingly Simple Backdoor Attack on Self-supervised Learning

As a new paradigm in machine learning, self-supervised learning (SSL) is capable of learning high-quality representations of complex data without relying on labels. In addition to eliminating the need for labeled data, research has found…

Cryptography and Security · Computer Science 2023-08-15 Changjiang Li , Ren Pang , Zhaohan Xi , Tianyu Du , Shouling Ji , Yuan Yao , Ting Wang

Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking

Self-Supervised Learning (SSL) is an effective paradigm for learning representations from unlabeled data, such as text, images, and videos. However, researchers have recently found that SSL is vulnerable to backdoor attacks. The attacker…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Shengsheng Qian , Dizhan Xue , Yifei Wang , Shengjie Zhang , Huaiwen Zhang , Changsheng Xu

The Perils of Learning From Unlabeled Data: Backdoor Attacks on Semi-supervised Learning

Semi-supervised machine learning (SSL) is gaining popularity as it reduces the cost of training ML models. It does so by using very small amounts of (expensive, well-inspected) labeled data and large amounts of (cheap, non-inspected)…

Cryptography and Security · Computer Science 2022-11-02 Virat Shejwalkar , Lingjuan Lyu , Amir Houmansadr

Invisible Backdoor Attack against Self-supervised Learning

Self-supervised learning (SSL) models are vulnerable to backdoor attacks. Existing backdoor attacks that are effective in SSL often involve noticeable triggers, like colored patches or visible noise, which are vulnerable to human…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Hanrong Zhang , Zhenting Wang , Boheng Li , Fulin Lin , Tingxu Han , Mingyu Jin , Chenlu Zhan , Mengnan Du , Hongwei Wang , Shiqing Ma

Filter, Obstruct and Dilute: Defending Against Backdoor Attacks on Semi-Supervised Learning

Recent studies have verified that semi-supervised learning (SSL) is vulnerable to data poisoning backdoor attacks. Even a tiny fraction of contaminated training data is sufficient for adversaries to manipulate up to 90\% of the test outputs…

Machine Learning · Computer Science 2025-02-11 Xinrui Wang , Chuanxing Geng , Wenhai Wan , Shao-yuan Li , Songcan Chen

Towards Adversarial Robustness And Backdoor Mitigation in SSL

Self-Supervised Learning (SSL) has shown great promise in learning representations from unlabeled data. The power of learning representations without the need for human annotations has made SSL a widely used technique in real-world…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Aryan Satpathy , Nilaksh Singh , Dhruva Rajwade , Somesh Kumar

Towards Understanding How Self-training Tolerates Data Backdoor Poisoning

Recent studies on backdoor attacks in model training have shown that polluting a small portion of training data is sufficient to produce incorrect manipulated predictions on poisoned test-time data while maintaining high clean accuracy in…

Machine Learning · Computer Science 2023-01-24 Soumyadeep Pal , Ren Wang , Yuguang Yao , Sijia Liu

Variance-Based Defense Against Blended Backdoor Attacks

Backdoor attacks represent a subtle yet effective class of cyberattacks targeting AI models, primarily due to their stealthy nature. The model behaves normally on clean data but exhibits malicious behavior only when the attacker embeds a…

Machine Learning · Computer Science 2025-09-29 Sujeevan Aseervatham , Achraf Kerzazi , Younès Bennani

Backdoor Defense through Self-Supervised and Generative Learning

Backdoor attacks change a small portion of training data by introducing hand-crafted triggers and rewiring the corresponding labels towards a desired target class. Training on such data injects a backdoor which causes malicious inference in…

Machine Learning · Computer Science 2024-09-05 Ivan Sabolić , Ivan Grubišić , Siniša Šegvić

Client-Side Patching against Backdoor Attacks in Federated Learning

Federated learning is a versatile framework for training models in decentralized environments. However, the trust placed in clients makes federated learning vulnerable to backdoor attacks launched by malicious participants. While many…

Cryptography and Security · Computer Science 2024-12-23 Borja Molina-Coronado

Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning

Semi-supervised learning methods can train high-accuracy machine learning models with a fraction of the labeled training samples required for traditional supervised learning. Such methods do not typically involve close review of the…

Machine Learning · Computer Science 2022-12-07 Marissa Connor , Vincent Emanuele

Backdoor Defense via Decoupling the Training Process

Recent studies have revealed that deep neural networks (DNNs) are vulnerable to backdoor attacks, where attackers embed hidden backdoors in the DNN model by poisoning a few training samples. The attacked model behaves normally on benign…

Cryptography and Security · Computer Science 2022-02-09 Kunzhe Huang , Yiming Li , Baoyuan Wu , Zhan Qin , Kui Ren

Beating Backdoor Attack at Its Own Game

Deep neural networks (DNNs) are vulnerable to backdoor attack, which does not affect the network's performance on clean data but would manipulate the network behavior once a trigger pattern is added. Existing defense methods have greatly…

Machine Learning · Computer Science 2025-04-08 Min Liu , Alberto Sangiovanni-Vincentelli , Xiangyu Yue

How to Craft Backdoors with Unlabeled Data Alone?

Relying only on unlabeled data, Self-supervised learning (SSL) can learn rich features in an economical and scalable way. As the drive-horse for building foundation models, SSL has received a lot of attention recently with wide…

Machine Learning · Computer Science 2024-04-24 Yifei Wang , Wenhan Ma , Stefanie Jegelka , Yisen Wang

Reverse Engineering Imperceptible Backdoor Attacks on Deep Neural Networks for Detection and Training Set Cleansing

Backdoor data poisoning is an emerging form of adversarial attack usually against deep neural network image classifiers. The attacker poisons the training set with a relatively small set of images from one (or several) source class(es),…

Machine Learning · Computer Science 2020-10-16 Zhen Xiang , David J. Miller , George Kesidis

SecureSplit: Mitigating Backdoor Attacks in Split Learning

Split Learning (SL) offers a framework for collaborative model training that respects data privacy by allowing participants to share the same dataset while maintaining distinct feature sets. However, SL is susceptible to backdoor attacks,…

Cryptography and Security · Computer Science 2026-01-27 Zhihao Dou , Dongfei Cui , Weida Wang , Anjun Gao , Yueyang Quan , Mengyao Ma , Viet Vo , Guangdong Bai , Zhuqing Liu , Minghong Fang

Training set cleansing of backdoor poisoning by self-supervised representation learning

A backdoor or Trojan attack is an important type of data poisoning attack against deep neural network (DNN) classifiers, wherein the training dataset is poisoned with a small number of samples that each possess the backdoor pattern (usually…

Machine Learning · Computer Science 2023-03-15 H. Wang , S. Karami , O. Dia , H. Ritter , E. Emamjomeh-Zadeh , J. Chen , Z. Xiang , D. J. Miller , G. Kesidis

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

In recent years there has been enormous interest in vision-language models trained using self-supervised objectives. However, the use of large-scale datasets scraped from the web for training also makes these models vulnerable to potential…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Alvi Md Ishmam , Christopher Thomas

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

Data-poisoning backdoor attacks are serious security threats to machine learning models, where an adversary can manipulate the training dataset to inject backdoors into models. In this paper, we focus on in-training backdoor defense, aiming…

Cryptography and Security · Computer Science 2024-10-16 Shaokui Wei , Hongyuan Zha , Baoyuan Wu