Related papers: Confidence Regularized Self-Training

An Uncertainty-Aware Pseudo-Label Selection Framework using Regularized Conformal Prediction

Consistency regularization-based methods are prevalent in semi-supervised learning (SSL) algorithms due to their exceptional performance. However, they mainly depend on domain-specific data augmentations, which are not usable in domains…

Machine Learning · Computer Science 2023-09-29 Matin Moezzi

LST: Lexicon-Guided Self-Training for Few-Shot Text Classification

Self-training provides an effective means of using an extremely small amount of labeled data to create pseudo-labels for unlabeled data. Many state-of-the-art self-training approaches hinge on different regularization methods to prevent…

Computation and Language · Computer Science 2022-02-08 Hazel Kim , Jaeman Son , Yo-Sub Han

Two-phase Pseudo Label Densification for Self-training based Domain Adaptation

Recently, deep self-training approaches emerged as a powerful solution to the unsupervised domain adaptation. The self-training scheme involves iterative processing of target data; it generates target pseudo labels and retrains the network.…

Computer Vision and Pattern Recognition · Computer Science 2020-12-10 Inkyu Shin , Sanghyun Woo , Fei Pan , InSo Kweon

Continuous Pseudo-Labeling from the Start

Self-training (ST), or pseudo-labeling has sparked significant interest in the automatic speech recognition (ASR) community recently because of its success in harnessing unlabeled data. Unlike prior semi-supervised learning approaches that…

Machine Learning · Computer Science 2023-04-11 Dan Berrebbi , Ronan Collobert , Samy Bengio , Navdeep Jaitly , Tatiana Likhomanenko

Revisiting Self-Training with Regularized Pseudo-Labeling for Tabular Data

Recent progress in semi- and self-supervised learning has caused a rift in the long-held belief about the need for an enormous amount of labeled data for machine learning and the irrelevancy of unlabeled data. Although it has been…

Machine Learning · Computer Science 2023-03-14 Minwook Kim , Juseong Kim , Giltae Song

Cycle Self-Training for Domain Adaptation

Mainstream approaches for unsupervised domain adaptation (UDA) learn domain-invariant representations to narrow the domain shift. Recently, self-training has been gaining momentum in UDA, which exploits unlabeled target data by training…

Machine Learning · Computer Science 2021-11-01 Hong Liu , Jianmin Wang , Mingsheng Long

Contrastive Regularization for Semi-Supervised Learning

Consistency regularization on label predictions becomes a fundamental technique in semi-supervised learning, but it still requires a large number of training iterations for high performance. In this study, we analyze that the consistency…

Machine Learning · Computer Science 2022-06-10 Doyup Lee , Sungwoong Kim , Ildoo Kim , Yeongjae Cheon , Minsu Cho , Wook-Shin Han

CAST: Cluster-Aware Self-Training for Tabular Data via Reliable Confidence

Tabular data is one of the most widely used data modalities, encompassing numerous datasets with substantial amounts of unlabeled data. Despite this prevalence, there is a notable lack of simple and versatile methods for utilizing unlabeled…

Machine Learning · Computer Science 2024-08-30 Minwook Kim , Juseong Kim , Ki Beom Kim , Giltae Song

Neighborhood-Regularized Self-Training for Learning with Few Labels

Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one…

Machine Learning · Computer Science 2023-02-17 Ran Xu , Yue Yu , Hejie Cui , Xuan Kan , Yanqiao Zhu , Joyce Ho , Chao Zhang , Carl Yang

Neural Networks Against (and For) Self-Training: Classification with Small Labeled and Large Unlabeled Sets

We propose a semi-supervised text classifier based on self-training using one positive and one negative property of neural networks. One of the weaknesses of self-training is the semantic drift problem, where noisy pseudo-labels accumulate…

Computation and Language · Computer Science 2024-01-02 Payam Karisani

Regularizing Self-training for Unsupervised Domain Adaptation via Structural Constraints

Self-training based on pseudo-labels has emerged as a dominant approach for addressing conditional distribution shifts in unsupervised domain adaptation (UDA) for semantic segmentation problems. A notable drawback, however, is that this…

Computer Vision and Pattern Recognition · Computer Science 2023-05-02 Rajshekhar Das , Jonathan Francis , Sanket Vaibhav Mehta , Jean Oh , Emma Strubell , Jose Moura

CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

Semi-supervised learning on class-imbalanced data, although a realistic problem, has been under studied. While existing semi-supervised learning (SSL) methods are known to perform poorly on minority classes, we find that they still generate…

Computer Vision and Pattern Recognition · Computer Science 2021-06-18 Chen Wei , Kihyuk Sohn , Clayton Mellina , Alan Yuille , Fan Yang

Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection

Deep learning-based object detectors have shown remarkable improvements. However, supervised learning-based methods perform poorly when the train data and the test data have different distributions. To address the issue, domain adaptation…

Computer Vision and Pattern Recognition · Computer Science 2019-09-04 Seunghyeon Kim , Jaehoon Choi , Taekyung Kim , Changick Kim

Consistency Regularization for Generalizable Source-free Domain Adaptation

Source-free domain adaptation (SFDA) aims to adapt a well-trained source model to an unlabelled target domain without accessing the source dataset, making it applicable in a variety of real-world scenarios. Existing SFDA methods ONLY assess…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Longxiang Tang , Kai Li , Chunming He , Yulun Zhang , Xiu Li

Statistical and Algorithmic Insights for Semi-supervised Learning with Self-training

Self-training is a classical approach in semi-supervised learning which is successfully applied to a variety of machine learning problems. Self-training algorithm generates pseudo-labels for the unlabeled examples and progressively refines…

Machine Learning · Computer Science 2020-06-22 Samet Oymak , Talha Cihad Gulcu

The Role of Pseudo-labels in Self-training Linear Classifiers on High-dimensional Gaussian Mixture Data

Self-training (ST) is a simple yet effective semi-supervised learning method. However, why and how ST improves generalization performance by using potentially erroneous pseudo-labels is still not well understood. To deepen the understanding…

Machine Learning · Statistics 2024-05-08 Takashi Takahashi

Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation

Self-training approach for large language models (LLMs) improves reasoning abilities by training the models on their self-generated rationales. Previous approaches have labeled rationales that produce correct answers for a given question as…

Machine Learning · Computer Science 2025-02-07 Jaehyeok Lee , Keisuke Sakaguchi , JinYeong Bak

Debiased Self-Training for Semi-Supervised Learning

Deep neural networks achieve remarkable performances on a wide range of tasks with the aid of large-scale labeled datasets. Yet these datasets are time-consuming and labor-exhaustive to obtain on realistic tasks. To mitigate the requirement…

Machine Learning · Computer Science 2022-11-10 Baixu Chen , Junguang Jiang , Ximei Wang , Pengfei Wan , Jianmin Wang , Mingsheng Long

Confidence Adaptive Regularization for Deep Learning with Noisy Labels

Recent studies on the memorization effects of deep neural networks on noisy labels show that the networks first fit the correctly-labeled training samples before memorizing the mislabeled samples. Motivated by this early-learning…

Machine Learning · Computer Science 2021-09-07 Yangdi Lu , Yang Bo , Wenbo He

Rethinking Self-training for Semi-supervised Landmark Detection: A Selection-free Approach

Self-training is a simple yet effective method for semi-supervised learning, during which pseudo-label selection plays an important role for handling confirmation bias. Despite its popularity, applying self-training to landmark detection…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Haibo Jin , Haoxuan Che , Hao Chen