English
Related papers

Related papers: SAT: Improving Semi-Supervised Text Classification…

200 papers

Semi-supervised learning has been an effective paradigm for leveraging unlabeled data to reduce the reliance on labeled data. We propose CoMatch, a new semi-supervised learning method that unifies dominant approaches and addresses their…

Machine Learning · Computer Science 2021-03-04 Junnan Li , Caiming Xiong , Steven Hoi

We propose UnMixMatch, a semi-supervised learning framework which can learn effective representations from unconstrained unlabelled data in order to scale up performance. Most existing semi-supervised methods rely on the assumption that…

Machine Learning · Computer Science 2024-01-17 Shuvendu Roy , Ali Etemad

We investigate a strategy for improving the efficiency of contrastive learning of visual representations by leveraging a small amount of supervised information during pre-training. We propose a semi-supervised loss, SuNCEt, based on…

Machine Learning · Computer Science 2020-12-03 Mahmoud Assran , Nicolas Ballas , Lluis Castrejon , Michael Rabbat

State-of-the-art deep neural networks require large-scale labeled training data that is often expensive to obtain or not available for many tasks. Weak supervision in the form of domain-specific rules has been shown to be useful in such…

Computation and Language · Computer Science 2021-04-13 Giannis Karamanolakis , Subhabrata Mukherjee , Guoqing Zheng , Ahmed Hassan Awadallah

Self-supervised learning is a machine learning approach that generates implicit labels by learning underlined patterns and extracting discriminative features from unlabeled data without manual labelling. Contrastive learning introduces the…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Asifullah Khan , Laiba Asmatullah , Anza Malik , Shahzaib Khan , Hamna Asif

Self-supervised pre-training using unlabeled data is widely used in automatic speech recognition. In this paper, we propose a new self-supervised pre-training approach to dealing with heterogeneous data. Instead of mixing all the data and…

Machine Learning · Computer Science 2025-09-10 Xiaodong Cui , A F M Saif , Brian Kingsbury , Tianyi Chen

Although attention mechanisms have become fundamental components of deep learning models, they are vulnerable to perturbations, which may degrade the prediction performance and model interpretability. Adversarial training (AT) for attention…

Computation and Language · Computer Science 2022-12-27 Shunsuke Kitada , Hitoshi Iyatomi

The lack of labeled data is a common challenge in speech classification tasks, particularly those requiring extensive subjective assessment, such as cognitive state classification. In this work, we propose a Semi-Supervised Learning (SSL)…

Audio and Speech Processing · Electrical Eng. & Systems 2025-05-01 Yuanchao Li , Zixing Zhang , Jing Han , Peter Bell , Catherine Lai

Constructing an organized dataset comprised of a large number of images and several captions for each image is a laborious task, which requires vast human effort. On the other hand, collecting a large number of images and sentences…

Computer Vision and Pattern Recognition · Computer Science 2019-11-22 Dong-Jin Kim , Jinsoo Choi , Tae-Hyun Oh , In So Kweon

While semi-supervised learning (SSL) algorithms provide an efficient way to make use of both labelled and unlabelled data, they generally struggle when the number of annotated samples is very small. In this work, we consider the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-23 Sylvestre-Alvise Rebuffi , Sebastien Ehrhardt , Kai Han , Andrea Vedaldi , Andrew Zisserman

Self-supervised sentence representation learning is the task of constructing an embedding space for sentences without relying on human annotation efforts. One straightforward approach is to finetune a pretrained language model (PLM) with a…

In medical image analysis, semi-supervised learning is an effective method to extract knowledge from a small amount of labeled data and a large amount of unlabeled data. This paper focuses on a popular pipeline known as self learning, and…

Computer Vision and Pattern Recognition · Computer Science 2020-08-10 Xinyue Huo , Lingxi Xie , Jianzhong He , Zijie Yang , Qi Tian

Semi- and weakly-supervised learning have recently attracted considerable attention in the object detection literature since they can alleviate the cost of annotation needed to successfully train deep learning models. State-of-art…

Computer Vision and Pattern Recognition · Computer Science 2022-06-20 Akhil Meethal , Marco Pedersoli , Zhongwen Zhu , Francisco Perdigon Romero , Eric Granger

Pre-training and self-training are two approaches to semi-supervised learning. The comparison between pre-training and self-training has been explored. However, the previous works led to confusing findings: self-training outperforms…

Computation and Language · Computer Science 2024-09-05 Yiheng Wang , Jiayu Lin , Zuoquan Lin

Pre-trained Transformer-based speech models have shown striking performance when fine-tuned on various downstream tasks such as automatic speech recognition and spoken language identification (SLID). However, the problem of domain mismatch…

Computation and Language · Computer Science 2023-12-13 Mohammed Maqsood Shaik , Dietrich Klakow , Badr M. Abdullah

In this work, we propose a simple yet effective semi-supervised learning approach called Augmented Distribution Alignment. We reveal that an essential sampling bias exists in semi-supervised learning due to the limited number of labeled…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Qin Wang , Wen Li , Luc Van Gool

Until recently, the number of public real-world text images was insufficient for training scene text recognizers. Therefore, most modern training methods rely on synthetic data and operate in a fully supervised manner. Nevertheless, the…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Aviad Aberdam , Roy Ganz , Shai Mazor , Ron Litman

This paper introduces SelfMatch, a semi-supervised learning method that combines the power of contrastive self-supervised learning and consistency regularization. SelfMatch consists of two stages: (1) self-supervised pre-training based on…

Machine Learning · Computer Science 2021-01-19 Byoungjip Kim , Jinho Choo , Yeong-Dae Kwon , Seongho Joe , Seungjai Min , Youngjune Gwon

Treating texts as images, combining prompts with textual labels for prompt tuning, and leveraging the alignment properties of CLIP have been successfully applied in zero-shot multi-label image recognition. Nonetheless, relying solely on…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Haonan Xu , Dian Chao , Xiangyu Wu , Zhonghua Wan , Yang Yang

Entailment has been recognized as an important metric for evaluating natural language understanding (NLU) models, and recent studies have found that entailment pretraining benefits weakly supervised fine-tuning. In this work, we design a…

Computation and Language · Computer Science 2023-05-30 Jiaxin Ge , Hongyin Luo , Yoon Kim , James Glass