Related papers: Fidelity-Weighted Learning

Learning to Rank from Samples of Variable Quality

Training deep neural networks requires many training samples, but in practice, training labels are expensive to obtain and may be of varying quality, as some may be from trusted expert labelers while others might be from heuristics or other…

Information Retrieval · Computer Science 2018-06-25 Mostafa Dehghani , Jaap Kamps

Avoiding Your Teacher's Mistakes: Training Neural Networks with Controlled Weak Supervision

Training deep neural networks requires massive amounts of training data, but for many tasks only limited labeled data is available. This makes weak supervision attractive, using weak or noisy signals like the output of heuristic methods or…

Machine Learning · Computer Science 2017-12-08 Mostafa Dehghani , Aliaksei Severyn , Sascha Rothe , Jaap Kamps

Weakly Supervised Label Learning Flows

Supervised learning usually requires a large amount of labelled data. However, attaining ground-truth labels is costly for many tasks. Alternatively, weakly supervised methods learn with cheap weak signals that only approximately label some…

Machine Learning · Computer Science 2024-11-26 You Lu , Wenzhuo Song , Chidubem Arachie , Bert Huang

Learning to Learn from Weak Supervision by Full Supervision

In this paper, we propose a method for training neural networks when we have a large set of data with weak labels and a small amount of data with true labels. In our proposed model, we train two neural networks: a target network, the…

Machine Learning · Statistics 2017-12-01 Mostafa Dehghani , Aliaksei Severyn , Sascha Rothe , Jaap Kamps

(FL)$^2$: Overcoming Few Labels in Federated Semi-Supervised Learning

Federated Learning (FL) is a distributed machine learning framework that trains accurate global models while preserving clients' privacy-sensitive data. However, most FL approaches assume that clients possess labeled data, which is often…

Machine Learning · Computer Science 2024-11-01 Seungjoo Lee , Thanh-Long V. Le , Jaemin Shin , Sung-Ju Lee

Revisiting Pretraining for Semi-Supervised Learning in the Low-Label Regime

Semi-supervised learning (SSL) addresses the lack of labeled data by exploiting large unlabeled data through pseudolabeling. However, in the extremely low-label regime, pseudo labels could be incorrect, a.k.a. the confirmation bias, and the…

Computer Vision and Pattern Recognition · Computer Science 2022-05-09 Xun Xu , Jingyi Liao , Lile Cai , Manh Cuong Nguyen , Kangkang Lu , Wanyue Zhang , Yasin Yazici , Chuan Sheng Foo

A Novel Technique for Robust Training of Deep Networks With Multisource Weak Labeled Remote Sensing Data

Deep learning has gained broad interest in remote sensing image scene classification thanks to the effectiveness of deep neural networks in extracting the semantics from complex data. However, deep networks require large amounts of training…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Gianmarco Perantoni , Lorenzo Bruzzone

Learning across label confidence distributions using Filtered Transfer Learning

Performance of neural network models relies on the availability of large datasets with minimal levels of uncertainty. Transfer Learning (TL) models have been proposed to resolve the issue of small dataset size by letting the model train on…

Machine Learning · Computer Science 2020-06-05 Seyed Ali Madani Tonekaboni , Andrew E. Brereton , Zhaleh Safikhani , Andreas Windemuth , Benjamin Haibe-Kains , Stephen MacKinnon

Weaker Than You Think: A Critical Look at Weakly Supervised Learning

Weakly supervised learning is a popular approach for training machine learning models in low-resource settings. Instead of requesting high-quality yet costly human annotations, it allows training models with noisy annotations obtained from…

Computation and Language · Computer Science 2023-09-19 Dawei Zhu , Xiaoyu Shen , Marius Mosbach , Andreas Stephan , Dietrich Klakow

Self-semi-supervised Learning to Learn from NoisyLabeled Data

The remarkable success of today's deep neural networks highly depends on a massive number of correctly labeled data. However, it is rather costly to obtain high-quality human-labeled data, leading to the active research area of training…

Machine Learning · Computer Science 2020-11-04 Jiacheng Wang , Yue Ma , Shuang Gao

Semi-supervised Learning using Robust Loss

The amount of manually labeled data is limited in medical applications, so semi-supervised learning and automatic labeling strategies can be an asset for training deep neural networks. However, the quality of the automatically generated…

Machine Learning · Computer Science 2022-03-04 Wenhui Cui , Haleh Akrami , Anand A. Joshi , Richard M. Leahy

Training Subset Selection for Weak Supervision

Existing weak supervision approaches use all the data covered by weak signals to train a classifier. We show both theoretically and empirically that this is not always optimal. Intuitively, there is a tradeoff between the amount of…

Machine Learning · Statistics 2023-03-08 Hunter Lang , Aravindan Vijayaraghavan , David Sontag

Robust Partial-Label Learning by Leveraging Class Activation Values

Real-world training data is often noisy; for example, human annotators assign conflicting class labels to the same instances. Partial-label learning (PLL) is a weakly supervised learning paradigm that allows training classifiers in this…

Machine Learning · Computer Science 2025-10-27 Tobias Fuchs , Florian Kalinke

FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning

Semi-supervised learning holds great promise for many real-world applications, due to its ability to leverage both unlabeled and expensive labeled data. However, most semi-supervised learning algorithms still heavily rely on the limited…

Machine Learning · Computer Science 2023-12-29 Huiling Qin , Xianyuan Zhan , Yuanxun Li , Yu Zheng

Frugal Reinforcement-based Active Learning

Most of the existing learning models, particularly deep neural networks, are reliant on large datasets whose hand-labeling is expensive and time demanding. A current trend is to make the learning of these models frugal and less dependent on…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Sebastien Deschamps , Hichem Sahbi

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

State-of-the-art, high capacity deep neural networks not only require large amounts of labelled training data, they are also highly susceptible to label errors in this data, typically resulting in large efforts and costs and therefore…

Machine Learning · Computer Science 2020-07-20 Christian Haase-Schütz , Rainer Stal , Heinz Hertlein , Bernhard Sick

Reliable Weakly Supervised Learning: Maximize Gain and Maintain Safeness

Weakly supervised data are widespread and have attracted much attention. However, since label quality is often difficult to guarantee, sometimes the use of weakly supervised data will lead to unsatisfactory performance, i.e., performance…

Machine Learning · Computer Science 2019-04-23 Lan-Zhe Guo , Yu-Feng Li , Ming Li , Jin-Feng Yi , Bo-Wen Zhou , Zhi-Hua Zhou

On the Power of Deep but Naive Partial Label Learning

Partial label learning (PLL) is a class of weakly supervised learning where each training instance consists of a data and a set of candidate labels containing a unique ground truth label. To tackle this problem, a majority of current…

Machine Learning · Computer Science 2021-02-09 Junghoon Seo , Joon Suk Huh

Empowering HWNs with Efficient Data Labeling: A Clustered Federated Semi-Supervised Learning Approach

Clustered Federated Multitask Learning (CFL) has gained considerable attention as an effective strategy for overcoming statistical challenges, particularly when dealing with non independent and identically distributed (non IID) data across…

Networking and Internet Architecture · Computer Science 2024-01-22 Moqbel Hamood , Abdullatif Albaseer , Mohamed Abdallah , Ala Al-Fuqaha

Self-Training with Weak Supervision

State-of-the-art deep neural networks require large-scale labeled training data that is often expensive to obtain or not available for many tasks. Weak supervision in the form of domain-specific rules has been shown to be useful in such…

Computation and Language · Computer Science 2021-04-13 Giannis Karamanolakis , Subhabrata Mukherjee , Guoqing Zheng , Ahmed Hassan Awadallah