Related papers: Learning to Robustly Aggregate Labeling Functions …

Semi-Supervised Data Programming with Subset Selection

The paradigm of data programming, which uses weak supervision in the form of rules/labelling functions, and semi-supervised learning, which augments small amounts of labelled data with a large unlabelled dataset, have shown great promise in…

Machine Learning · Computer Science 2021-06-15 Ayush Maheshwari , Oishik Chatterjee , KrishnaTeja Killamsetty , Ganesh Ramakrishnan , Rishabh Iyer

Weakly Supervised Label Learning Flows

Supervised learning usually requires a large amount of labelled data. However, attaining ground-truth labels is costly for many tasks. Alternatively, weakly supervised methods learn with cheap weak signals that only approximately label some…

Machine Learning · Computer Science 2024-11-26 You Lu , Wenzhuo Song , Chidubem Arachie , Bert Huang

Semi-supervised Learning using Robust Loss

The amount of manually labeled data is limited in medical applications, so semi-supervised learning and automatic labeling strategies can be an asset for training deep neural networks. However, the quality of the automatically generated…

Machine Learning · Computer Science 2022-03-04 Wenhui Cui , Haleh Akrami , Anand A. Joshi , Richard M. Leahy

FlexSSL : A Generic and Efficient Framework for Semi-Supervised Learning

Semi-supervised learning holds great promise for many real-world applications, due to its ability to leverage both unlabeled and expensive labeled data. However, most semi-supervised learning algorithms still heavily rely on the limited…

Machine Learning · Computer Science 2023-12-29 Huiling Qin , Xianyuan Zhan , Yuanxun Li , Yu Zheng

Robust Feature Learning Against Noisy Labels

Supervised learning of deep neural networks heavily relies on large-scale datasets annotated by high-quality labels. In contrast, mislabeled samples can significantly degrade the generalization of models and result in memorizing samples,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Tsung-Ming Tai , Yun-Jie Jhang , Wen-Jyi Hwang

Communication-Efficient Robust Federated Learning with Noisy Labels

Federated learning (FL) is a promising privacy-preserving machine learning paradigm over distributed located data. In FL, the data is kept locally by each user. This protects the user privacy, but also makes the server difficult to verify…

Machine Learning · Computer Science 2022-06-14 Junyi Li , Jian Pei , Heng Huang

Refining Labeling Functions with Limited Labeled Data

Programmatic weak supervision (PWS) significantly reduces human effort for labeling data by combining the outputs of user-provided labeling functions (LFs) on unlabeled datapoints. However, the quality of the generated labels depends…

Machine Learning · Computer Science 2025-06-05 Chenjie Li , Amir Gilad , Boris Glavic , Zhengjie Miao , Sudeepa Roy

Data Programming using Continuous and Quality-Guided Labeling Functions

Scarcity of labeled data is a bottleneck for supervised learning models. A paradigm that has evolved for dealing with this problem is data programming. An existing data programming paradigm allows human supervision to be provided as a set…

Machine Learning · Computer Science 2019-11-25 Oishik Chatterjee , Ganesh Ramakrishnan , Sunita Sarawagi

SELF: Learning to Filter Noisy Labels with Self-Ensembling

Deep neural networks (DNNs) have been shown to over-fit a dataset when being trained with noisy labels for a long enough time. To overcome this problem, we present a simple and effective method self-ensemble label filtering (SELF) to…

Computer Vision and Pattern Recognition · Computer Science 2019-10-07 Duc Tam Nguyen , Chaithanya Kumar Mummadi , Thi Phuong Nhung Ngo , Thi Hoai Phuong Nguyen , Laura Beggel , Thomas Brox

Meta Label Correction for Noisy Label Learning

Leveraging weak or noisy supervision for building effective machine learning models has long been an important research problem. Its importance has further increased recently due to the growing need for large-scale datasets to train deep…

Machine Learning · Computer Science 2021-08-09 Guoqing Zheng , Ahmed Hassan Awadallah , Susan Dumais

Learning from Noisy Labels via Self-Taught On-the-Fly Meta Loss Rescaling

Correct labels are indispensable for training effective machine learning models. However, creating high-quality labels is expensive, and even professionally labeled data contains errors and ambiguities. Filtering and denoising can be…

Computation and Language · Computer Science 2024-12-18 Michael Heck , Christian Geishauser , Nurul Lubis , Carel van Niekerk , Shutong Feng , Hsien-Chin Lin , Benjamin Matthias Ruppik , Renato Vukovic , Milica Gašić

Reliable Label Bootstrapping for Semi-Supervised Learning

Reducing the amount of labels required to train convolutional neural networks without performance degradation is key to effectively reduce human annotation efforts. We propose Reliable Label Bootstrapping (ReLaB), an unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2021-02-26 Paul Albert , Diego Ortego , Eric Arazo , Noel E. O'Connor , Kevin McGuinness

Meta-learning of semi-supervised learning from tasks with heterogeneous attribute spaces

We propose a meta-learning method for semi-supervised learning that learns from multiple tasks with heterogeneous attribute spaces. The existing semi-supervised meta-learning methods assume that all tasks share the same attribute space,…

Machine Learning · Computer Science 2023-11-10 Tomoharu Iwata , Atsutoshi Kumagai

Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees

Learning with noisy labels remains challenging because over-parameterized networks memorize corrupted supervision. Meta-learning-based sample reweighting mitigates this by using a small clean subset to guide training, yet its behavior and…

Machine Learning · Computer Science 2025-10-15 Yiming Zhang , Chester Holtz , Gal Mishne , Alex Cloninger

Learning Hyper Label Model for Programmatic Weak Supervision

To reduce the human annotation efforts, the programmatic weak supervision (PWS) paradigm abstracts weak supervision sources as labeling functions (LFs) and involves a label model to aggregate the output of multiple LFs to produce training…

Machine Learning · Computer Science 2023-03-09 Renzhi Wu , Shen-En Chen , Jieyu Zhang , Xu Chu

Learning to Learn from Noisy Labeled Data

Despite the success of deep neural networks (DNNs) in image classification tasks, the human-level performance relies on massive training data with high-quality manual annotations, which are expensive and time-consuming to collect. There…

Machine Learning · Computer Science 2019-04-15 Junnan Li , Yongkang Wong , Qi Zhao , Mohan Kankanhalli

Self-semi-supervised Learning to Learn from NoisyLabeled Data

The remarkable success of today's deep neural networks highly depends on a massive number of correctly labeled data. However, it is rather costly to obtain high-quality human-labeled data, leading to the active research area of training…

Machine Learning · Computer Science 2020-11-04 Jiacheng Wang , Yue Ma , Shuang Gao

Distilling Effective Supervision from Severe Label Noise

Collecting large-scale data with clean labels for supervised training of neural networks is practically challenging. Although noisy labels are usually cheap to acquire, existing methods suffer a lot from label noise. This paper targets at…

Machine Learning · Computer Science 2020-06-16 Zizhao Zhang , Han Zhang , Sercan O. Arik , Honglak Lee , Tomas Pfister

MetaLabelNet: Learning to Generate Soft-Labels from Noisy-Labels

Real-world datasets commonly have noisy labels, which negatively affects the performance of deep neural networks (DNNs). In order to address this problem, we propose a label noise robust learning algorithm, in which the base classifier is…

Machine Learning · Computer Science 2022-07-13 Görkem Algan , Ilkay Ulusoy

APAM: Adaptive Pre-training and Adaptive Meta Learning in Language Model for Noisy Labels and Long-tailed Learning

Practical natural language processing (NLP) tasks are commonly long-tailed with noisy labels. Those problems challenge the generalization and robustness of complex models such as Deep Neural Networks (DNNs). Some commonly used resampling…

Computation and Language · Computer Science 2023-05-04 Sunyi Chi , Bo Dong , Yiming Xu , Zhenyu Shi , Zheng Du