Related papers: Prototype Guided Backdoor Defense

Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models

Multimodal pretrained models are vulnerable to backdoor attacks, yet most existing methods rely on visual or multimodal triggers, which are impractical since visually embedded triggers rarely occur in real-world data. To overcome this…

Cryptography and Security · Computer Science 2026-04-08 Yiyang Zhang , Chaojian Yu , Ziming Hong , Yuanjie Shao , Qinmu Peng , Tongliang Liu , Xinge You

Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor

Data-poisoning backdoor attacks are serious security threats to machine learning models, where an adversary can manipulate the training dataset to inject backdoors into models. In this paper, we focus on in-training backdoor defense, aiming…

Cryptography and Security · Computer Science 2024-10-16 Shaokui Wei , Hongyuan Zha , Baoyuan Wu

Variance-Based Defense Against Blended Backdoor Attacks

Backdoor attacks represent a subtle yet effective class of cyberattacks targeting AI models, primarily due to their stealthy nature. The model behaves normally on clean data but exhibits malicious behavior only when the attacker embeds a…

Machine Learning · Computer Science 2025-09-29 Sujeevan Aseervatham , Achraf Kerzazi , Younès Bennani

Proactive Disentangled Modeling of Trigger-Object Pairings for Backdoor Defense

Deep neural networks (DNNs) and generative AI (GenAI) are increasingly vulnerable to backdoor attacks, where adversaries embed triggers into inputs to cause models to misclassify or misinterpret target labels. Beyond traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Kyle Stein , Andrew A. Mahyari , Guillermo Francia , Eman El-Sheikh

Detecting Backdoor Attacks via Similarity in Semantic Communication Systems

Semantic communication systems, which leverage Generative AI (GAI) to transmit semantic meaning rather than raw data, are poised to revolutionize modern communications. However, they are vulnerable to backdoor attacks, a type of poisoning…

Cryptography and Security · Computer Science 2025-02-07 Ziyang Wei , Yili Jiang , Jiaqi Huang , Fangtian Zhong , Sohan Gyawali

Towards Backdoor Attacks and Defense in Robust Machine Learning Models

The introduction of robust optimisation has pushed the state-of-the-art in defending against adversarial attacks. Notably, the state-of-the-art projected gradient descent (PGD)-based training method has been shown to be universally and…

Computer Vision and Pattern Recognition · Computer Science 2023-01-12 Ezekiel Soremekun , Sakshi Udeshi , Sudipta Chattopadhyay

Prototype-Guided Robust Learning against Backdoor Attacks

Backdoor attacks poison the training data, causing the model to behave normally on clean inputs but predict attacker-chosen labels when trigger patterns are embedded into the input samples. Defending against such attacks is highly…

Cryptography and Security · Computer Science 2026-04-28 Wei Guo , Maura Pintor , Ambra Demontis , Battista Biggio

Robust Image Classification: Defensive Strategies against FGSM and PGD Adversarial Attacks

Adversarial attacks, particularly the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) pose significant threats to the robustness of deep learning models in image classification. This paper explores and refines defense…

Cryptography and Security · Computer Science 2025-05-15 Hetvi Waghela , Jaydip Sen , Sneha Rakshit

An Invisible Backdoor Attack Based On Semantic Feature

Backdoor attacks have severely threatened deep neural network (DNN) models in the past several years. These attacks can occur in almost every stage of the deep learning pipeline. Although the attacked model behaves normally on benign…

Computer Vision and Pattern Recognition · Computer Science 2024-05-21 Yangming Chen

SegPGD: An Effective and Efficient Adversarial Attack for Evaluating and Boosting Segmentation Robustness

Deep neural network-based image classifications are vulnerable to adversarial perturbations. The image classifications can be easily fooled by adding artificial small and imperceptible perturbations to input images. As one of the most…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Jindong Gu , Hengshuang Zhao , Volker Tresp , Philip Torr

Backdoor Defense through Self-Supervised and Generative Learning

Backdoor attacks change a small portion of training data by introducing hand-crafted triggers and rewiring the corresponding labels towards a desired target class. Training on such data injects a backdoor which causes malicious inference in…

Machine Learning · Computer Science 2024-09-05 Ivan Sabolić , Ivan Grubišić , Siniša Šegvić

Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

In recent years there has been enormous interest in vision-language models trained using self-supervised objectives. However, the use of large-scale datasets scraped from the web for training also makes these models vulnerable to potential…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Alvi Md Ishmam , Christopher Thomas

Test-Time Backdoor Defense via Detecting and Repairing

Deep neural networks have played a crucial part in many critical domains, such as autonomous driving, face recognition, and medical diagnosis. However, deep neural networks are facing security threats from backdoor attacks and can be…

Cryptography and Security · Computer Science 2023-11-30 Jiyang Guan , Jian Liang , Ran He

XGBD: Explanation-Guided Graph Backdoor Detection

Backdoor attacks pose a significant security risk to graph learning models. Backdoors can be embedded into the target model by inserting backdoor triggers into the training dataset, causing the model to make incorrect predictions when the…

Cryptography and Security · Computer Science 2023-08-09 Zihan Guan , Mengnan Du , Ninghao Liu

PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection

Deep neural networks are susceptible to backdoor attacks, where adversaries manipulate model predictions by inserting malicious samples into the training data. Currently, there is still a significant challenge in identifying suspicious…

Machine Learning · Computer Science 2025-04-17 Wei Li , Pin-Yu Chen , Sijia Liu , Ren Wang

Countering Backdoor Attacks in Image Recognition: A Survey and Evaluation of Mitigation Strategies

The widespread adoption of deep learning across various industries has introduced substantial challenges, particularly in terms of model explainability and security. The inherent complexity of deep learning models, while contributing to…

Cryptography and Security · Computer Science 2025-01-08 Kealan Dunnett , Reza Arablouei , Dimity Miller , Volkan Dedeoglu , Raja Jurdak

Backdoor Defense via Deconfounded Representation Learning

Deep neural networks (DNNs) are recently shown to be vulnerable to backdoor attacks, where attackers embed hidden backdoors in the DNN model by injecting a few poisoned examples into the training dataset. While extensive efforts have been…

Artificial Intelligence · Computer Science 2023-03-14 Zaixi Zhang , Qi Liu , Zhicai Wang , Zepu Lu , Qingyong Hu

Stealthy Targeted Backdoor Attacks against Image Captioning

In recent years, there has been an explosive growth in multimodal learning. Image captioning, a classical multimodal task, has demonstrated promising applications and attracted extensive research attention. However, recent studies have…

Cryptography and Security · Computer Science 2024-06-11 Wenshu Fan , Hongwei Li , Wenbo Jiang , Meng Hao , Shui Yu , Xiao Zhang

Hidden Trigger Backdoor Attacks

With the success of deep learning algorithms in various domains, studying adversarial attacks to secure deep models in real world applications has become an important research topic. Backdoor attacks are a form of adversarial attacks on…

Computer Vision and Pattern Recognition · Computer Science 2019-12-24 Aniruddha Saha , Akshayvarun Subramanya , Hamed Pirsiavash

Towards Unified Robustness Against Both Backdoor and Adversarial Attacks

Deep Neural Networks (DNNs) are known to be vulnerable to both backdoor and adversarial attacks. In the literature, these two types of attacks are commonly treated as distinct robustness problems and solved separately, since they belong to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-29 Zhenxing Niu , Yuyao Sun , Qiguang Miao , Rong Jin , Gang Hua