Related papers: A Benchmark Generative Probabilistic Model for Wea…

Multi-Agent VLMs Guided Self-Training with PNU Loss for Low-Resource Offensive Content Detection

Accurate detection of offensive content on social media demands high-quality labeled data; however, such data is often scarce due to the low prevalence of offensive instances and the high cost of manual annotation. To address this…

Machine Learning · Computer Science 2025-11-19 Han Wang , Deyi Ji , Junyu Lu , Lanyun Zhu , Hailong Zhang , Haiyang Wu , Liqun Liu , Peng Shu , Roy Ka-Wei Lee

High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation

Image-level weakly-supervised semantic segmentation (WSSS) reduces the usually vast data annotation cost by surrogate segmentation masks during training. The typical approach involves training an image classification network using global…

Computer Vision and Pattern Recognition · Computer Science 2024-06-10 Arvi Jonnarth , Yushan Zhang , Michael Felsberg

Pseudo Label Selection is a Decision Problem

Pseudo-Labeling is a simple and effective approach to semi-supervised learning. It requires criteria that guide the selection of pseudo-labeled data. The latter have been shown to crucially affect pseudo-labeling's generalization…

Machine Learning · Computer Science 2023-09-27 Julian Rodemann

PLMCL: Partial-Label Momentum Curriculum Learning for Multi-Label Image Classification

Multi-label image classification aims to predict all possible labels in an image. It is usually formulated as a partial-label learning problem, given the fact that it could be expensive in practice to annotate all labels in every training…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Rabab Abdelfattah , Xin Zhang , Zhenyao Wu , Xinyi Wu , Xiaofeng Wang , Song Wang

Sparse Learning and Class Probability Estimation with Weighted Support Vector Machines

Classification and probability estimation are fundamental tasks with broad applications across modern machine learning and data science, spanning fields such as biology, medicine, engineering, and computer science. Recent development of…

Methodology · Statistics 2026-03-25 Liyun Zeng , Hao Helen Zhang

Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning

Semi-supervised learning, i.e. jointly learning from labeled and unlabeled samples, is an active research topic due to its key role on relaxing human supervision. In the context of image classification, recent advances to learn from…

Computer Vision and Pattern Recognition · Computer Science 2020-06-30 Eric Arazo , Diego Ortego , Paul Albert , Noel E. O'Connor , Kevin McGuinness

Boosting Semi-Supervised Learning by bridging high and low-confidence predictions

Pseudo-labeling is a crucial technique in semi-supervised learning (SSL), where artificial labels are generated for unlabeled data by a trained model, allowing for the simultaneous training of labeled and unlabeled data in a supervised…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Khanh-Binh Nguyen , Joon-Sung Yang

Progressive Representative Labeling for Deep Semi-Supervised Learning

Deep semi-supervised learning (SSL) has experienced significant attention in recent years, to leverage a huge amount of unlabeled data to improve the performance of deep learning with limited labeled data. Pseudo-labeling is a popular…

Computer Vision and Pattern Recognition · Computer Science 2021-08-16 Xiaopeng Yan , Riquan Chen , Litong Feng , Jingkang Yang , Huabin Zheng , Wayne Zhang

Generative Pseudo-Labeling for Pre-Ranking with LLMs

Pre-ranking is a critical stage in industrial recommendation systems, tasked with efficiently scoring thousands of recalled items for downstream ranking. A key challenge is the train-serving discrepancy: pre-ranking models are trained only…

Information Retrieval · Computer Science 2026-02-25 Junyu Bi , Xinting Niu , Daixuan Cheng , Kun Yuan , Tao Wang , Binbin Cao , Jian Wu , Yuning Jiang

In all LikelihoodS: How to Reliably Select Pseudo-Labeled Data for Self-Training in Semi-Supervised Learning

Self-training is a simple yet effective method within semi-supervised learning. The idea is to iteratively enhance training data by adding pseudo-labeled data. Its generalization performance heavily depends on the selection of these…

Machine Learning · Statistics 2023-03-03 Julian Rodemann , Christoph Jansen , Georg Schollmeyer , Thomas Augustin

Feedback-Driven Pseudo-Label Reliability Assessment: Redefining Thresholding for Semi-Supervised Semantic Segmentation

Semi-supervised learning leverages unlabeled data to enhance model performance, addressing the limitations of fully supervised approaches. Among its strategies, pseudo-supervision has proven highly effective, typically relying on one or…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Negin Ghamsarian , Sahar Nasirihaghighi , Klaus Schoeffmann , Raphael Sznitman

MLSL: Multi-Level Self-Supervised Learning for Domain Adaptation with Spatially Independent and Semantically Consistent Labeling

Most of the recent Deep Semantic Segmentation algorithms suffer from large generalization errors, even when powerful hierarchical representation models based on convolutional neural networks have been employed. This could be attributed to…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Javed Iqbal , Mohsen Ali

Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection

Weakly supervised video anomaly detection (WSVAD) is a challenging task. Generating fine-grained pseudo-labels based on weak-label and then self-training a classifier is currently a promising solution. However, since the existing methods…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Zhiwei Yang , Jing Liu , Peng Wu

Semi-supervised Salient Object Detection with Effective Confidence Estimation

The success of existing salient object detection models relies on a large pixel-wise labeled training dataset, which is time-consuming and expensive to obtain. We study semi-supervised salient object detection, with access to a small number…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Jiawei Liu , Jing Zhang , Nick Barnes

The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming

Most advanced supervised Machine Learning (ML) models rely on vast amounts of point-by-point labelled training examples. Hand-labelling vast amounts of data may be tedious, expensive, and error-prone. Recently, some studies have explored…

Machine Learning · Computer Science 2021-08-27 Chufan Gao , Mononito Goswami

Can VLM Pseudo-Labels Train a Time-Series QA Model That Outperforms the VLM?

Time-series question answering (TSQA) tasks face significant challenges due to the lack of labeled data. Alternatively, with recent advancements in large-scale models, vision-language models (VLMs) have demonstrated the potential to analyze…

Machine Learning · Computer Science 2025-10-01 Takuya Fujimura , Kota Dohi , Natsuo Yamashita , Yohei Kawaguchi

Leveraging Instance Features for Label Aggregation in Programmatic Weak Supervision

Programmatic Weak Supervision (PWS) has emerged as a widespread paradigm to synthesize training labels efficiently. The core component of PWS is the label model, which infers true labels by aggregating the outputs of multiple noisy…

Machine Learning · Computer Science 2022-10-11 Jieyu Zhang , Linxin Song , Alexander Ratner

Semi-Supervised Few-Shot Learning with Prototypical Random Walks

Recent progress has shown that few-shot learning can be improved with access to unlabelled data, known as semi-supervised few-shot learning(SS-FSL). We introduce an SS-FSL approach, dubbed as Prototypical Random Walk Networks(PRWN), built…

Machine Learning · Computer Science 2021-02-10 Ahmed Ayyad , Yuchen Li , Nassir Navab , Shadi Albarqouni , Mohamed Elhoseiny

SemiNLL: A Framework of Noisy-Label Learning by Semi-Supervised Learning

Deep learning with noisy labels is a challenging task. Recent prominent methods that build on a specific sample selection (SS) strategy and a specific semi-supervised learning (SSL) model achieved state-of-the-art performance. Intuitively,…

Machine Learning · Computer Science 2020-12-03 Zhuowei Wang , Jing Jiang , Bo Han , Lei Feng , Bo An , Gang Niu , Guodong Long

Semi-Supervised Lifelong Language Learning

Lifelong learning aims to accumulate knowledge and alleviate catastrophic forgetting when learning tasks sequentially. However, existing lifelong language learning methods only focus on the supervised learning setting. Unlabeled data, which…

Computation and Language · Computer Science 2022-11-24 Yingxiu Zhao , Yinhe Zheng , Bowen Yu , Zhiliang Tian , Dongkyu Lee , Jian Sun , Haiyang Yu , Yongbin Li , Nevin L. Zhang