English
Related papers

Related papers: Interactive Label Cleaning with Example-based Expl…

200 papers

In learning tasks with label noise, improving model robustness against overfitting is a pivotal challenge because the model eventually memorizes labels, including the noisy ones. Identifying the samples with noisy labels and preventing the…

Machine Learning · Computer Science 2023-09-28 Reihaneh Torkzadehmahani , Reza Nasirigerdeh , Daniel Rueckert , Georgios Kaissis

Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have an often-overlooked confounding effect on the assessment of model performance. Nevertheless, employing experts to…

Deep models trained with noisy labels are prone to over-fitting and struggle in generalization. Most existing solutions are based on an ideal assumption that the label noise is class-conditional, i.e., instances of the same class share the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-01 Ganlong Zhao , Guanbin Li , Yipeng Qin , Feng Liu , Yizhou Yu

Label noise is ubiquitous in various machine learning scenarios such as self-labeling with model predictions and erroneous data annotation. Many existing approaches are based on heuristics such as sample losses, which might not be flexible…

Machine Learning · Computer Science 2022-12-29 Zhihao Wang , Zongyu Lin , Peiqi Liu , Guidong ZHeng , Junjie Wen , Xianxin Chen , Yujun Chen , Zhilin Yang

We propose a method for jointly inferring labels across a collection of data samples, where each sample consists of an observation and a prior belief about the label. By implicitly assuming the existence of a generative model for which a…

Machine Learning · Computer Science 2022-06-22 Esther Rolf , Nikolay Malkin , Alexandros Graikos , Ana Jojic , Caleb Robinson , Nebojsa Jojic

Deep learning models rely heavily on large volumes of labeled data to achieve high performance. However, real-world datasets often contain noisy labels due to human error, ambiguity, or resource constraints during the annotation process.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Gouranga Bala , Anuj Gupta , Subrat Kumar Behera , Amit Sethi

In this paper we propose novel methodologies to construct Support Vector Machine -based classifiers that takes into account that label noises occur in the training sample. We propose different alternatives based on solving Mixed Integer…

Machine Learning · Computer Science 2020-04-22 Víctor Blanco , Alberto Japón , Justo Puerto

High-quality labels are expensive to obtain for many machine learning tasks, such as medical image classification tasks. Therefore, probabilistic (weak) labels produced by weak supervision tools are used to seed a process in which…

Databases · Computer Science 2021-07-27 Yinjun Wu , James Weimer , Susan B. Davidson

Several works in computer vision have demonstrated the effectiveness of active learning for adapting the recognition model when new unlabeled data becomes available. Most of these works consider that labels obtained from the annotator are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Sudipta Paul , Shivkumar Chandrasekaran , B. S. Manjunath , Amit K. Roy-Chowdhury

Label noise, commonly found in real-world datasets, has a detrimental impact on a model's generalization. To effectively detect incorrectly labeled instances, previous works have mostly relied on distinguishable training signals, such as…

Machine Learning · Computer Science 2024-05-31 Suyeon Kim , Dongha Lee , SeongKu Kang , Sukang Chae , Sanghwan Jang , Hwanjo Yu

Semi-supervised learning based methods are current SOTA solutions to the noisy-label learning problem, which rely on learning an unsupervised label cleaner first to divide the training samples into a labeled set for clean data and an…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Jingjia Huang , Yuanqi Chen , Jiashi Feng , Xinglong Wu

Label noise will degenerate the performance of deep learning algorithms because deep neural networks easily overfit label errors. Let X and Y denote the instance and clean label, respectively. When Y is a cause of X, according to which many…

Machine Learning · Statistics 2022-06-06 Yu Yao , Tongliang Liu , Mingming Gong , Bo Han , Gang Niu , Kun Zhang

Learning from noisy labels (LNL) is crucial in deep learning, in which one of the approaches is to identify clean-label samples from poorly-annotated datasets. Such an identification is challenging because the conventional LNL problem,…

Machine Learning · Computer Science 2025-09-26 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro

Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework…

Machine Learning · Computer Science 2023-12-20 Yu Wang , Xin Xin , Zaiqiao Meng , Joemon Jose , Fuli Feng

Label noise - incorrect labels assigned to observations - can substantially degrade the performance of supervised classifiers. This paper proposes a label noise cleaning method based on Bernoulli random sampling. We show that the mean label…

Methodology · Statistics 2026-03-17 Yuxin Liu , Xiong Jin , Yang Han

Learning with noisy labels has aroused much research interest since data annotations, especially for large-scale datasets, may be inevitably imperfect. Recent approaches resort to a semi-supervised learning problem by dividing training…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Kai Wang , Xiangyu Peng , Shuo Yang , Jianfei Yang , Zheng Zhu , Xinchao Wang , Yang You

Representing a true label as a one-hot vector is a common practice in training text classification models. However, the one-hot representation may not adequately reflect the relation between the instances and labels, as labels are often not…

Computation and Language · Computer Science 2020-12-10 Biyang Guo , Songqiao Han , Xiao Han , Hailiang Huang , Ting Lu

Learning from noisy data has attracted much attention, where most methods focus on closed-set label noise. However, a more common scenario in the real world is the presence of both open-set and closed-set noise. Existing methods typically…

Machine Learning · Computer Science 2024-02-26 Wenhai Wan , Xinrui Wang , Ming-Kun Xie , Shao-Yuan Li , Sheng-Jun Huang , Songcan Chen

Partial multi-label learning and complementary multi-label learning are two popular weakly supervised multi-label classification paradigms that aim to alleviate the high annotation costs of collecting precisely annotated multi-label data.…

Machine Learning · Computer Science 2026-02-26 Wei Wang , Tianhao Ma , Ming-Kun Xie , Gang Niu , Masashi Sugiyama

Learning with Noisy labels (LNL) poses a significant challenge for the Machine Learning community. Some of the most widely used approaches that select as clean samples for which the model itself (the in-training model) has high confidence,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Chen Feng , Georgios Tzimiropoulos , Ioannis Patras
‹ Prev 1 2 3 10 Next ›