Related papers: A Benchmark Generative Probabilistic Model for Wea…

Generating Self-Guided Dense Annotations for Weakly Supervised Semantic Segmentation

Learning semantic segmentation models under image-level supervision is far more challenging than under fully supervised setting. Without knowing the exact pixel-label correspondence, most weakly-supervised methods rely on external models to…

Computer Vision and Pattern Recognition · Computer Science 2018-10-17 Zi-Yi Ke , Chiou-Ting Hsu

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

Semi-Supervised Learning (SSL) is a framework that utilizes both labeled and unlabeled data to enhance model performance. Conventional SSL methods operate under the assumption that labeled and unlabeled data share the same label space.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Noam Fluss , Guy Hacohen , Daphna Weinshall

Training Complex Models with Multi-Task Weak Supervision

As machine learning models continue to increase in complexity, collecting large hand-labeled training sets has become one of the biggest roadblocks in practice. Instead, weaker forms of supervision that provide noisier but cheaper labels…

Machine Learning · Statistics 2018-12-10 Alexander Ratner , Braden Hancock , Jared Dunnmon , Frederic Sala , Shreyash Pandey , Christopher Ré

From Lazy to Prolific: Tackling Missing Labels in Open Vocabulary Extreme Classification by Positive-Unlabeled Sequence Learning

Open-vocabulary Extreme Multi-label Classification (OXMC) extends traditional XMC by allowing prediction beyond an extremely large, predefined label set (typically $10^3$ to $10^{12}$ labels), addressing the dynamic nature of real-world…

Information Retrieval · Computer Science 2025-01-10 Ranran Haoran Zhang , Bensu Uçar , Soumik Dey , Hansi Wu , Binbin Li , Rui Zhang

A Weakly Supervised Learning Approach based on Spectral Graph-Theoretic Grouping

In this study, a spectral graph-theoretic grouping strategy for weakly supervised classification is introduced, where a limited number of labelled samples and a larger set of unlabelled samples are used to construct a larger annotated…

Machine Learning · Computer Science 2015-08-04 Tameem Adel , Alexander Wong , Daniel Stashuk

Prompt-based Pseudo-labeling Strategy for Sample-Efficient Semi-Supervised Extractive Summarization

Semi-supervised learning (SSL) is a widely used technique in scenarios where labeled data is scarce and unlabeled data is abundant. While SSL is popular for image and text classification, it is relatively underexplored for the task of…

Computation and Language · Computer Science 2024-07-03 Gaurav Sahu , Olga Vechtomova , Issam H. Laradji

Generative Semi-supervised Learning with Meta-Optimized Synthetic Samples

Semi-supervised learning (SSL) is a promising approach for training deep classification models using labeled and unlabeled datasets. However, existing SSL methods rely on a large unlabeled dataset, which may not always be available in many…

Machine Learning · Computer Science 2023-09-29 Shin'ya Yamaguchi

Structured Prediction with Partial Labelling through the Infimum Loss

Annotating datasets is one of the main costs in nowadays supervised learning. The goal of weak supervision is to enable models to learn using only forms of labelling which are cheaper to collect, as partial labelling. This is a type of…

Machine Learning · Computer Science 2021-02-02 Vivien Cabannes , Alessandro Rudi , Francis Bach

Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels

The task of weakly supervised temporal action localization targets at generating temporal boundaries for actions of interest, meanwhile the action category should also be classified. Pseudo-label-based methods, which serve as an effective…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Jingqiu Zhou , Linjiang Huang , Liang Wang , Si Liu , Hongsheng Li

Webly Supervised Image Classification with Self-Contained Confidence

This paper focuses on webly supervised learning (WSL), where datasets are built by crawling samples from the Internet and directly using search queries as web labels. Although WSL benefits from fast and low-cost data collection, noises in…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Jingkang Yang , Litong Feng , Weirong Chen , Xiaopeng Yan , Huabin Zheng , Ping Luo , Wayne Zhang

Learning Dependency Structures for Weak Supervision Models

Labeling training data is a key bottleneck in the modern machine learning pipeline. Recent weak supervision approaches combine labels from multiple noisy sources by estimating their accuracies without access to ground truth labels; however,…

Machine Learning · Statistics 2019-03-15 Paroma Varma , Frederic Sala , Ann He , Alexander Ratner , Christopher Ré

Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models

Recent advancements in Self-Supervised Learning (SSL) have shown promising results in Speaker Verification (SV). However, narrowing the performance gap with supervised systems remains an ongoing challenge. Several studies have observed that…

Audio and Speech Processing · Electrical Eng. & Systems 2025-06-25 Victor Miara , Theo Lepage , Reda Dehak

Integrated Weak Learning

We introduce Integrated Weak Learning, a principled framework that integrates weak supervision into the training process of machine learning models. Our approach jointly trains the end-model and a label model that aggregates multiple…

Machine Learning · Computer Science 2022-06-22 Peter Hayes , Mingtian Zhang , Raza Habib , Jordan Burgess , Emine Yilmaz , David Barber

Boosting Semi-Supervised Semantic Segmentation with Probabilistic Representations

Recent breakthroughs in semi-supervised semantic segmentation have been developed through contrastive learning. In prevalent pixel-wise contrastive learning solutions, the model maps pixels to deterministic representations and regularizes…

Computer Vision and Pattern Recognition · Computer Science 2022-12-19 Haoyu Xie , Changqi Wang , Mingkai Zheng , Minjing Dong , Shan You , Chong Fu , Chang Xu

Semi-Supervised Few-Shot Adaptation of Vision-Language Models

Vision-language models (VLMs) pre-trained on large, heterogeneous data sources are becoming increasingly popular, providing rich multi-modal embeddings that enable efficient transfer to new tasks. A particularly relevant application is…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Julio Silva-Rodríguez , Ender Konukoglu

FRACTAL: Fine-Grained Scoring from Aggregate Text Labels

Large language models (LLMs) are being increasingly tuned to power complex generation tasks such as writing, fact-seeking, querying and reasoning. Traditionally, human or model feedback for evaluating and further tuning LLM performance has…

Computation and Language · Computer Science 2024-04-09 Yukti Makhija , Priyanka Agrawal , Rishi Saket , Aravindan Raghuveer

Making Large Language Models Better Data Creators

Although large language models (LLMs) have advanced the state-of-the-art in NLP significantly, deploying them for downstream applications is still challenging due to cost, responsiveness, control, or concerns around privacy and security. As…

Computation and Language · Computer Science 2023-11-01 Dong-Ho Lee , Jay Pujara , Mohit Sewak , Ryen W. White , Sujay Kumar Jauhar

LLMaAA: Making Large Language Models as Active Annotators

Prevalent supervised learning methods in natural language processing (NLP) are notoriously data-hungry, which demand large amounts of high-quality annotated data. In practice, acquiring such data is a costly endeavor. Recently, the superior…

Computation and Language · Computer Science 2023-11-01 Ruoyu Zhang , Yanzeng Li , Yongliang Ma , Ming Zhou , Lei Zou

Evaluating Self-Supervised Speech Models via Text-Based LLMS

Self-Supervised Learning (SSL) has gained traction for its ability to learn rich representations with low labeling costs, applicable across diverse downstream tasks. However, assessing the downstream-task performance remains challenging due…

Sound · Computer Science 2025-10-07 Takashi Maekaku , Keita Goto , Jinchuan Tian , Yusuke Shinohara , Shinji Watanabe

Learning from Similarity-Confidence and Confidence-Difference

In practical machine learning applications, it is often challenging to assign accurate labels to data, and increasing the number of labeled instances is often limited. In such cases, Weakly Supervised Learning (WSL), which enables training…

Machine Learning · Computer Science 2026-03-24 Tomoya Tate , Kosuke Sugiyama , Masato Uchida