Related papers: An Embarrassingly Simple Backdoor Attack on Self-s…

The Perils of Learning From Unlabeled Data: Backdoor Attacks on Semi-supervised Learning

Semi-supervised machine learning (SSL) is gaining popularity as it reduces the cost of training ML models. It does so by using very small amounts of (expensive, well-inspected) labeled data and large amounts of (cheap, non-inspected)…

Cryptography and Security · Computer Science 2022-11-02 Virat Shejwalkar , Lingjuan Lyu , Amir Houmansadr

Towards Adversarial Robustness And Backdoor Mitigation in SSL

Self-Supervised Learning (SSL) has shown great promise in learning representations from unlabeled data. The power of learning representations without the need for human annotations has made SSL a widely used technique in real-world…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Aryan Satpathy , Nilaksh Singh , Dhruva Rajwade , Somesh Kumar

Invisible Backdoor Attack against Self-supervised Learning

Self-supervised learning (SSL) models are vulnerable to backdoor attacks. Existing backdoor attacks that are effective in SSL often involve noticeable triggers, like colored patches or visible noise, which are vulnerable to human…

Computer Vision and Pattern Recognition · Computer Science 2025-04-04 Hanrong Zhang , Zhenting Wang , Boheng Li , Fulin Lin , Tingxu Han , Mingyu Jin , Chenlu Zhan , Mengnan Du , Hongwei Wang , Shiqing Ma

How to Craft Backdoors with Unlabeled Data Alone?

Relying only on unlabeled data, Self-supervised learning (SSL) can learn rich features in an economical and scalable way. As the drive-horse for building foundation models, SSL has received a lot of attention recently with wide…

Machine Learning · Computer Science 2024-04-24 Yifei Wang , Wenhan Ma , Stefanie Jegelka , Yisen Wang

Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning

Recently, self-supervised learning (SSL) was shown to be vulnerable to patch-based data poisoning backdoor attacks. It was shown that an adversary can poison a small part of the unlabeled data so that when a victim trains an SSL model on…

Computer Vision and Pattern Recognition · Computer Science 2023-04-05 Ajinkya Tejankar , Maziar Sanjabi , Qifan Wang , Sinong Wang , Hamed Firooz , Hamed Pirsiavash , Liang Tan

Self-supervised Learning is More Robust to Dataset Imbalance

Self-supervised learning (SSL) is a scalable way to learn general visual representations since it learns without labels. However, large-scale unlabeled datasets in the wild often have long-tailed label distributions, where we know little…

Machine Learning · Computer Science 2022-05-24 Hong Liu , Jeff Z. HaoChen , Adrien Gaidon , Tengyu Ma

Backdoor Attacks on Self-Supervised Learning

Large-scale unlabeled data has spurred recent progress in self-supervised learning methods that learn rich visual representations. State-of-the-art self-supervised methods for learning representations from images (e.g., MoCo, BYOL, MSF) use…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Aniruddha Saha , Ajinkya Tejankar , Soroush Abbasi Koohpayegani , Hamed Pirsiavash

Filter, Obstruct and Dilute: Defending Against Backdoor Attacks on Semi-Supervised Learning

Recent studies have verified that semi-supervised learning (SSL) is vulnerable to data poisoning backdoor attacks. Even a tiny fraction of contaminated training data is sufficient for adversaries to manipulate up to 90\% of the test outputs…

Machine Learning · Computer Science 2025-02-11 Xinrui Wang , Chuanxing Geng , Wenhai Wan , Shao-yuan Li , Songcan Chen

Robust Deep Semi-Supervised Learning: A Brief Introduction

Semi-supervised learning (SSL) is the branch of machine learning that aims to improve learning performance by leveraging unlabeled data when labels are insufficient. Recently, SSL with deep models has proven to be successful on standard…

Machine Learning · Computer Science 2022-11-15 Lan-Zhe Guo , Zhi Zhou , Yu-Feng Li

On the Difficulty of Defending Contrastive Learning against Backdoor Attacks

Recent studies have shown that contrastive learning, like supervised learning, is highly vulnerable to backdoor attacks wherein malicious functions are injected into target models, only to be activated by specific triggers. However, thus…

Cryptography and Security · Computer Science 2023-12-15 Changjiang Li , Ren Pang , Bochuan Cao , Zhaohan Xi , Jinghui Chen , Shouling Ji , Ting Wang

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning

Self-supervised learning (SSL) is a prevalent approach for encoding data representations. Using a pre-trained SSL image encoder and subsequently training a downstream classifier, impressive performance can be achieved on various tasks with…

Cryptography and Security · Computer Science 2024-07-18 Mengxin Zheng , Jiaqi Xue , Zihao Wang , Xun Chen , Qian Lou , Lei Jiang , Xiaofeng Wang

Semi-Supervised Learning with Scarce Annotations

While semi-supervised learning (SSL) algorithms provide an efficient way to make use of both labelled and unlabelled data, they generally struggle when the number of annotated samples is very small. In this work, we consider the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-23 Sylvestre-Alvise Rebuffi , Sebastien Ehrhardt , Kai Han , Andrea Vedaldi , Andrew Zisserman

Reverse Engineering Self-Supervised Learning

Self-supervised learning (SSL) is a powerful tool in machine learning, but understanding the learned representations and their underlying mechanisms remains a challenge. This paper presents an in-depth empirical analysis of SSL-trained…

Machine Learning · Computer Science 2023-06-01 Ido Ben-Shaul , Ravid Shwartz-Ziv , Tomer Galanti , Shai Dekel , Yann LeCun

Erasing Self-Supervised Learning Backdoor by Cluster Activation Masking

Self-Supervised Learning (SSL) is an effective paradigm for learning representations from unlabeled data, such as text, images, and videos. However, researchers have recently found that SSL is vulnerable to backdoor attacks. The attacker…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Shengsheng Qian , Dizhan Xue , Yifei Wang , Shengjie Zhang , Huaiwen Zhang , Changsheng Xu

Making Self-supervised Learning Robust to Spurious Correlation via Learning-speed Aware Sampling

Self-supervised learning (SSL) has emerged as a powerful technique for learning rich representations from unlabeled data. The data representations are able to capture many underlying attributes of data, and be useful in downstream…

Machine Learning · Computer Science 2023-12-01 Weicheng Zhu , Sheng Liu , Carlos Fernandez-Granda , Narges Razavian

Rethinking Backdoor Data Poisoning Attacks in the Context of Semi-Supervised Learning

Semi-supervised learning methods can train high-accuracy machine learning models with a fraction of the labeled training samples required for traditional supervised learning. Such methods do not typically involve close review of the…

Machine Learning · Computer Science 2022-12-07 Marissa Connor , Vincent Emanuele

Rethinking Self-Supervised Learning: Small is Beautiful

Self-supervised learning (SSL), in particular contrastive learning, has made great progress in recent years. However, a common theme in these methods is that they inherit the learning paradigm from the supervised deep learning scenario.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Yun-Hao Cao , Jianxin Wu

A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

Deep supervised learning algorithms typically require a large volume of labeled data to achieve satisfactory performance. However, the process of collecting and labeling such data can be expensive and time-consuming. Self-supervised…

Machine Learning · Computer Science 2024-07-16 Jie Gui , Tuo Chen , Jing Zhang , Qiong Cao , Zhenan Sun , Hao Luo , Dacheng Tao

On the Difficulty of Defending Self-Supervised Learning against Model Extraction

Self-Supervised Learning (SSL) is an increasingly popular ML paradigm that trains models to transform complex inputs into representations without relying on explicit labels. These representations encode similarity structures that enable…

Machine Learning · Computer Science 2022-06-30 Adam Dziedzic , Nikita Dhawan , Muhammad Ahmad Kaleem , Jonas Guan , Nicolas Papernot

A Closer Look at Benchmarking Self-Supervised Pre-training with Image Classification

Self-supervised learning (SSL) is a machine learning approach where the data itself provides supervision, eliminating the need for external labels. The model is forced to learn about the data structure or context by solving a pretext task.…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Markus Marks , Manuel Knott , Neehar Kondapaneni , Elijah Cole , Thijs Defraeye , Fernando Perez-Cruz , Pietro Perona