Related papers: A Benchmark Generative Probabilistic Model for Wea…

Learning Hyper Label Model for Programmatic Weak Supervision

To reduce the human annotation efforts, the programmatic weak supervision (PWS) paradigm abstracts weak supervision sources as labeling functions (LFs) and involves a label model to aggregate the output of multiple LFs to produce training…

Machine Learning · Computer Science 2023-03-09 Renzhi Wu , Shen-En Chen , Jieyu Zhang , Xu Chu

Creating Training Sets via Weak Indirect Supervision

Creating labeled training sets has become one of the major roadblocks in machine learning. To address this, recent \emph{Weak Supervision (WS)} frameworks synthesize training labels from multiple potentially noisy supervision sources.…

Machine Learning · Computer Science 2022-03-16 Jieyu Zhang , Bohan Wang , Xiangchen Song , Yujing Wang , Yaming Yang , Jing Bai , Alexander Ratner

Learning with Partial Labels from Semi-supervised Perspective

Partial Label (PL) learning refers to the task of learning from the partially labeled data, where each training instance is ambiguously equipped with a set of candidate labels but only one is valid. Advances in the recent deep PL learning…

Machine Learning · Computer Science 2022-12-01 Ximing Li , Yuanzhi Jiang , Changchun Li , Yiyuan Wang , Jihong Ouyang

Generative Modeling Helps Weak Supervision (and Vice Versa)

Many promising applications of supervised machine learning face hurdles in the acquisition of labeled data in sufficient quantity and quality, creating an expensive bottleneck. To overcome such limitations, techniques that do not depend on…

Machine Learning · Computer Science 2023-03-14 Benedikt Boecking , Nicholas Roberts , Willie Neiswanger , Stefano Ermon , Frederic Sala , Artur Dubrawski

Pseudo-labelling meets Label Smoothing for Noisy Partial Label Learning

We motivate weakly supervised learning as an effective learning paradigm for problems where curating perfectly annotated datasets is expensive and may require domain expertise such as fine-grained classification. We focus on Partial Label…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Darshana Saravanan , Naresh Manwani , Vineet Gandhi

Learning to Rank from Samples of Variable Quality

Training deep neural networks requires many training samples, but in practice, training labels are expensive to obtain and may be of varying quality, as some may be from trusted expert labelers while others might be from heuristics or other…

Information Retrieval · Computer Science 2018-06-25 Mostafa Dehghani , Jaap Kamps

Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition

This paper looks at semi-supervised learning (SSL) for image-based text recognition. One of the most popular SSL approaches is pseudo-labeling (PL). PL approaches assign labels to unlabeled data before re-training the model with a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Gaurav Patel , Jan Allebach , Qiang Qiu

A Survey on Programmatic Weak Supervision

Labeling training data has become one of the major roadblocks to using machine learning. Among various weak supervision paradigms, programmatic weak supervision (PWS) has achieved remarkable success in easing the manual labeling bottleneck…

Machine Learning · Computer Science 2022-02-15 Jieyu Zhang , Cheng-Yu Hsieh , Yue Yu , Chao Zhang , Alexander Ratner

Convex and Scalable Weakly Labeled SVMs

In this paper, we study the problem of learning from weakly labeled data, where labels of the training examples are incomplete. This includes, for example, (i) semi-supervised learning where labels are partially known; (ii) multi-instance…

Machine Learning · Computer Science 2020-07-07 Yu-Feng Li , Ivor W. Tsang , James T. Kwok , Zhi-Hua Zhou

Probabilistic Neural Network Training for Semi-Supervised Classifiers

In this paper, we propose another version of help-training approach by employing a Probabilistic Neural Network (PNN) that improves the performance of the main discriminative classifier in the semi-supervised strategy. We introduce the…

Machine Learning · Computer Science 2015-09-07 Hamidreza Farhidzadeh

Losses over Labels: Weakly Supervised Learning via Direct Loss Construction

Owing to the prohibitive costs of generating large amounts of labeled data, programmatic weak supervision is a growing paradigm within machine learning. In this setting, users design heuristics that provide noisy labels for subsets of the…

Machine Learning · Computer Science 2023-10-06 Dylan Sam , J. Zico Kolter

Fusing Conditional Submodular GAN and Programmatic Weak Supervision

Programmatic Weak Supervision (PWS) and generative models serve as crucial tools that enable researchers to maximize the utility of existing datasets without resorting to laborious data gathering and manual annotation processes. PWS uses…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Kumar Shubham , Pranav Sastry , Prathosh AP

Learning from Multiple Noisy Partial Labelers

Programmatic weak supervision creates models without hand-labeled training data by combining the outputs of heuristic labelers. Existing frameworks make the restrictive assumption that labelers output a single class label. Enabling users to…

Machine Learning · Computer Science 2022-03-28 Peilin Yu , Tiffany Ding , Stephen H. Bach

Active WeaSuL: Improving Weak Supervision with Active Learning

The availability of labelled data is one of the main limitations in machine learning. We can alleviate this using weak supervision: a framework that uses expert-defined rules $\boldsymbol{\lambda}$ to estimate probabilistic labels…

Machine Learning · Computer Science 2021-05-03 Samantha Biegel , Rafah El-Khatib , Luiz Otavio Vilas Boas Oliveira , Max Baak , Nanne Aben

Weakly Supervised Pseudo-Label assisted Learning for ALS Point Cloud Semantic Segmentation

Competitive point cloud semantic segmentation results usually rely on a large amount of labeled data. However, data annotation is a time-consuming and labor-intensive task, particularly for three-dimensional point cloud data. Thus,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-06 Puzuo Wang , Wei Yao

Weakly Supervised Label Learning Flows

Supervised learning usually requires a large amount of labelled data. However, attaining ground-truth labels is costly for many tasks. Alternatively, weakly supervised methods learn with cheap weak signals that only approximately label some…

Machine Learning · Computer Science 2024-11-26 You Lu , Wenzhuo Song , Chidubem Arachie , Bert Huang

Pseudo strong labels for large scale weakly supervised audio tagging

Large-scale audio tagging datasets inevitably contain imperfect labels, such as clip-wise annotated (temporally weak) tags with no exact on- and offsets, due to a high manual labeling cost. This work proposes pseudo strong labels (PSL), a…

Sound · Computer Science 2022-04-29 Heinrich Dinkel , Zhiyong Yan , Yongqing Wang , Junbo Zhang , Yujun Wang

Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning

Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xingping Dong , Tianran Ouyang , Shengcai Liao , Bo Du , Ling Shao

Bridging Weakly-Supervised Learning and VLM Distillation: Noisy Partial Label Learning for Efficient Downstream Adaptation

In the context of noisy partial label learning (NPLL), each training sample is associated with a set of candidate labels annotated by multiple noisy annotators. With the emergence of high-performance pre-trained vision-language models…

Computer Vision and Pattern Recognition · Computer Science 2026-01-30 Qian-Wei Wang , Yaguang Song , Shu-Tao Xia

PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning

Weakly-supervised learning (WSL) has shown promising results in addressing label scarcity on many NLP tasks, but manually designing a comprehensive, high-quality labeling rule set is tedious and difficult. We study interactive…

Computation and Language · Computer Science 2022-03-21 Rongzhi Zhang , Yue Yu , Pranav Shetty , Le Song , Chao Zhang