Related papers: Multi-Resolution Weak Supervision for Sequential D…

Training Complex Models with Multi-Task Weak Supervision

As machine learning models continue to increase in complexity, collecting large hand-labeled training sets has become one of the biggest roadblocks in practice. Instead, weaker forms of supervision that provide noisier but cheaper labels…

Machine Learning · Statistics 2018-12-10 Alexander Ratner , Braden Hancock , Jared Dunnmon , Frederic Sala , Shreyash Pandey , Christopher Ré

A Novel Technique for Robust Training of Deep Networks With Multisource Weak Labeled Remote Sensing Data

Deep learning has gained broad interest in remote sensing image scene classification thanks to the effectiveness of deep neural networks in extracting the semantics from complex data. However, deep networks require large amounts of training…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Gianmarco Perantoni , Lorenzo Bruzzone

Learning Dependency Structures for Weak Supervision Models

Labeling training data is a key bottleneck in the modern machine learning pipeline. Recent weak supervision approaches combine labels from multiple noisy sources by estimating their accuracies without access to ground truth labels; however,…

Machine Learning · Statistics 2019-03-15 Paroma Varma , Frederic Sala , Ann He , Alexander Ratner , Christopher Ré

Data Consistency for Weakly Supervised Learning

In many applications, training machine learning models involves using large amounts of human-annotated data. Obtaining precise labels for the data is expensive. Instead, training with weak supervision provides a low-cost alternative. We…

Machine Learning · Computer Science 2022-02-09 Chidubem Arachie , Bert Huang

Integrated Weak Learning

We introduce Integrated Weak Learning, a principled framework that integrates weak supervision into the training process of machine learning models. Our approach jointly trains the end-model and a label model that aggregates multiple…

Machine Learning · Computer Science 2022-06-22 Peter Hayes , Mingtian Zhang , Raza Habib , Jordan Burgess , Emine Yilmaz , David Barber

Constrained Labeling for Weakly Supervised Learning

Curation of large fully supervised datasets has become one of the major roadblocks for machine learning. Weak supervision provides an alternative to supervised learning by training with cheap, noisy, and possibly correlated labeling…

Machine Learning · Computer Science 2021-06-01 Chidubem Arachie , Bert Huang

Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning

Collecting large-scale medical datasets with fine-grained annotations is time-consuming and requires experts. For this reason, weakly supervised learning aims at optimising machine learning models using weaker forms of annotations, such as…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Gabriele Valvano , Andrea Leo , Sotirios A. Tsaftaris

Meta-Learning for Neural Relation Classification with Distant Supervision

Distant supervision provides a means to create a large number of weakly labeled data at low cost for relation classification. However, the resulting labeled instances are very noisy, containing data with wrong labels. Many approaches have…

Computation and Language · Computer Science 2020-10-27 Zhenzhen Li , Jian-Yun Nie , Benyou Wang , Pan Du , Yuhan Zhang , Lixin Zou , Dongsheng Li

Training Subset Selection for Weak Supervision

Existing weak supervision approaches use all the data covered by weak signals to train a classifier. We show both theoretically and empirically that this is not always optimal. Intuitively, there is a tradeoff between the amount of…

Machine Learning · Statistics 2023-03-08 Hunter Lang , Aravindan Vijayaraghavan , David Sontag

Weak Supervision with Incremental Source Accuracy Estimation

Motivated by the desire to generate labels for real-time data we develop a method to estimate the dependency structure and accuracy of weak supervision sources incrementally. Our method first estimates the dependency structure associated…

Machine Learning · Computer Science 2022-05-12 Richard Gresham Correro

Towards Theoretical Understanding of Weak Supervision for Information Retrieval

Neural network approaches have recently shown to be effective in several information retrieval (IR) tasks. However, neural approaches often require large volumes of training data to perform effectively, which is not always available. To…

Information Retrieval · Computer Science 2018-06-14 Hamed Zamani , W. Bruce Croft

Mitigating Noisy Supervision Using Synthetic Samples with Soft Labels

Noisy labels are ubiquitous in real-world datasets, especially in the large-scale ones derived from crowdsourcing and web searching. It is challenging to train deep neural networks with noisy datasets since the networks are prone to…

Computer Vision and Pattern Recognition · Computer Science 2024-06-26 Yangdi Lu , Wenbo He

End-to-End Weak Supervision

Aggregating multiple sources of weak supervision (WS) can ease the data-labeling bottleneck prevalent in many machine learning applications, by replacing the tedious manual collection of ground truth labels. Current state of the art…

Machine Learning · Computer Science 2021-12-01 Salva Rühling Cachay , Benedikt Boecking , Artur Dubrawski

An Adaptive Method for Weak Supervision with Drifting Data

We introduce an adaptive method with formal quality guarantees for weak supervision in a non-stationary setting. Our goal is to infer the unknown labels of a sequence of data by using weak supervision sources that provide independent noisy…

Machine Learning · Computer Science 2025-05-05 Alessio Mazzetto , Reza Esfandiarpoor , Akash Singirikonda , Eli Upfal , Stephen H. Bach

Multi-View Video-Based Learning: Leveraging Weak Labels for Frame-Level Perception

For training a video-based action recognition model that accepts multi-view video, annotating frame-level labels is tedious and difficult. However, it is relatively easy to annotate sequence-level labels. This kind of coarse annotations are…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Vijay John , Yasutomo Kawanishi

The limited availability of ground truth relevance labels has been a major impediment to the application of supervised methods to ad-hoc retrieval. As a result, unsupervised scoring methods, such as BM25, remain strong competitors to deep…

Information Retrieval · Computer Science 2019-07-23 Dany Haddad , Joydeep Ghosh

Automating Weak Label Generation for Data Programming with Clinicians in the Loop

Large Deep Neural Networks (DNNs) are often data hungry and need high-quality labeled data in copious amounts for learning to converge. This is a challenge in the field of medicine since high quality labeled data is often scarce. Data…

Machine Learning · Computer Science 2024-07-12 Jean Park , Sydney Pugh , Kaustubh Sridhar , Mengyu Liu , Navish Yarna , Ramneet Kaur , Souradeep Dutta , Elena Bernardis , Oleg Sokolsky , Insup Lee

Semi-supervised Deep Learning for Image Classification with Distribution Mismatch: A Survey

Deep learning methodologies have been employed in several different fields, with an outstanding success in image recognition applications, such as material quality control, medical imaging, autonomous driving, etc. Deep learning models rely…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Saul Calderon-Ramirez , Shengxiang Yang , David Elizondo

Reliable Weakly Supervised Learning: Maximize Gain and Maintain Safeness

Weakly supervised data are widespread and have attracted much attention. However, since label quality is often difficult to guarantee, sometimes the use of weakly supervised data will lead to unsatisfactory performance, i.e., performance…

Machine Learning · Computer Science 2019-04-23 Lan-Zhe Guo , Yu-Feng Li , Ming Li , Jin-Feng Yi , Bo-Wen Zhou , Zhi-Hua Zhou

Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing

Labeled data is a critical resource for training and evaluating machine learning models. However, many real-life datasets are only partially labeled. We propose a semi-supervised machine learning training strategy to improve event detection…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Florian Dubost , Erin Hong , Nandita Bhaskhar , Siyi Tang , Daniel Rubin , Christopher Lee-Messer