English
Related papers

Related papers: SAT: Improving Semi-Supervised Text Classification…

200 papers

Unsupervised representation learning algorithms such as word2vec and ELMo improve the accuracy of many supervised NLP models, mainly because they can take advantage of large amounts of unlabeled text. However, the supervised models only…

Computation and Language · Computer Science 2018-09-25 Kevin Clark , Minh-Thang Luong , Christopher D. Manning , Quoc V. Le

Deep neural networks are gaining increasing popularity for the classic text classification task, due to their strong expressive power and less requirement for feature engineering. Despite such attractiveness, neural text classification…

Information Retrieval · Computer Science 2018-09-13 Yu Meng , Jiaming Shen , Chao Zhang , Jiawei Han

In this paper, we investigate self-supervised pre-training methods for document text recognition. Nowadays, large unlabeled datasets can be collected for many research tasks, including text recognition, but it is costly to annotate them.…

Computer Vision and Pattern Recognition · Computer Science 2024-05-02 Martin Kišš , Michal Hradiš

We consider the task of semi-supervised semantic segmentation, where we aim to produce pixel-wise semantic object masks given only a small number of human-labeled training examples. We focus on iterative self-training methods in which we…

Computer Vision and Pattern Recognition · Computer Science 2022-05-02 Eu Wern Teh , Terrance DeVries , Brendan Duke , Ruowei Jiang , Parham Aarabi , Graham W. Taylor

Text classification aims to effectively categorize documents into pre-defined categories. Traditional methods for text classification often rely on large amounts of manually annotated training data, making the process time-consuming and…

Computation and Language · Computer Science 2023-11-02 Daniel Hajialigol , Hanwen Liu , Xuan Wang

Pseudo-labelling is a popular technique in unsuper-vised domain adaptation for semantic segmentation. However, pseudo labels are noisy and inevitably have confirmation bias due to the discrepancy between source and target domains and…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Wanyu Xu , Zengmao Wang , Wei Bian

Self-supervised learning (SSL) has emerged as a promising paradigm that presents supervisory signals to real-world problems, bypassing the extensive cost of manual labeling. Consequently, self-supervised anomaly detection (SSAD) has seen a…

Machine Learning · Computer Science 2025-07-22 Jaemin Yoo , Lingxiao Zhao , Leman Akoglu

Human-centric visual analysis plays a pivotal role in diverse applications, including surveillance, healthcare, and human-computer interaction. With the emergence of large-scale unlabeled human image datasets, there is an increasing need…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Mingshuang Luo , Ruibing Hou , Bo Chao , Hong Chang , Zimo Liu , Yaowei Wang , Shiguang Shan

Learning from large amounts of unsupervised data and a small amount of supervision is an important open problem in computer vision. We propose a new semi-supervised learning method, Semantic Positives via Pseudo-Labels (SemPPL), that…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Matko Bošnjak , Pierre H. Richemond , Nenad Tomasev , Florian Strub , Jacob C. Walker , Felix Hill , Lars Holger Buesing , Razvan Pascanu , Charles Blundell , Jovana Mitrovic

Self-training is a standard approach to semi-supervised learning where the learner's own predictions on unlabeled data are used as supervision during training. In this paper, we reinterpret this label assignment process as an optimal…

Machine Learning · Computer Science 2021-06-15 Kai Sheng Tai , Peter Bailis , Gregory Valiant

Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised…

Sound · Computer Science 2020-11-17 Eduardo Fonseca , Diego Ortego , Kevin McGuinness , Noel E. O'Connor , Xavier Serra

In order to train robust deep learning models, large amounts of labelled data is required. However, in the absence of such large repositories of labelled data, unlabeled data can be exploited for the same. Semi-Supervised learning aims to…

Machine Learning · Computer Science 2021-07-20 Soumyadeep Ghosh , Sanjay Kumar , Janu Verma , Awanish Kumar

Predicting the trajectories of surrounding objects is a critical task for self-driving vehicles and many other autonomous systems. Recent works demonstrate that adversarial attacks on trajectory prediction, where small crafted perturbations…

Machine Learning · Computer Science 2023-03-22 Ruochen Jiao , Xiangguo Liu , Takami Sato , Qi Alfred Chen , Qi Zhu

This paper presents a study on semi-supervised learning to solve the visual attribute prediction problem. In many applications of vision algorithms, the precise recognition of visual attributes of objects is important but still challenging.…

Computer Vision and Pattern Recognition · Computer Science 2020-07-15 Minchul Shin

Semi-supervised learning on class-imbalanced data, although a realistic problem, has been under studied. While existing semi-supervised learning (SSL) methods are known to perform poorly on minority classes, we find that they still generate…

Computer Vision and Pattern Recognition · Computer Science 2021-06-18 Chen Wei , Kihyuk Sohn , Clayton Mellina , Alan Yuille , Fan Yang

Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this…

Computation and Language · Computer Science 2023-05-26 Chaoqun Liu , Wenxuan Zhang , Guizhen Chen , Xiaobao Wu , Anh Tuan Luu , Chip Hong Chang , Lidong Bing

Micro-Action Recognition (MAR) aims to classify subtle human actions in video. However, annotating MAR datasets is particularly challenging due to the subtlety of actions. To this end, we introduce the setting of Semi-Supervised MAR…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Yan Zhang , Lechao Cheng , Yaxiong Wang , Zhun Zhong , Meng Wang

Self-training is a well-known approach for semi-supervised learning. It consists of iteratively assigning pseudo-labels to unlabeled data for which the model is confident and treating them as labeled examples. For neural networks, softmax…

Machine Learning · Computer Science 2024-04-04 Ambroise Odonnat , Vasilii Feofanov , Ievgen Redko

Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most…

Machine Learning · Computer Science 2024-12-30 Jia-Hao Xiao , Ming-Kun Xie , Heng-Bo Fan , Gang Niu , Masashi Sugiyama , Sheng-Jun Huang

Recent advancements in Deep and Self-Supervised Learning (SSL) have led to substantial improvements in Speech Emotion Recognition (SER) performance, reaching unprecedented levels. However, obtaining sufficient amounts of accurately labeled…

Computation and Language · Computer Science 2025-02-25 Bulat Khaertdinov , Pedro Jeuris , Annanda Sousa , Enrique Hortal