Related papers: Labelling as an unsupervised learning problem

Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

We consider the unsupervised learning problem of assigning labels to unlabeled data. A naive approach is to use clustering methods, but this works well only when data is properly clustered and each cluster corresponds to an underlying…

Machine Learning · Computer Science 2013-05-02 Marthinus Christoffel du Plessis , Masashi Sugiyama

Learning from Noisy Label Distributions

In this paper, we consider a novel machine learning problem, that is, learning a classifier from noisy label distributions. In this problem, each instance with a feature vector belongs to at least one group. Then, instead of the true label…

Machine Learning · Computer Science 2017-08-17 Yuya Yoshikawa

Exploiting Context for Robustness to Label Noise in Active Learning

Several works in computer vision have demonstrated the effectiveness of active learning for adapting the recognition model when new unlabeled data becomes available. Most of these works consider that labels obtained from the annotator are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Sudipta Paul , Shivkumar Chandrasekaran , B. S. Manjunath , Amit K. Roy-Chowdhury

Label Ranking through Nonparametric Regression

Label Ranking (LR) corresponds to the problem of learning a hypothesis that maps features to rankings over a finite set of labels. We adopt a nonparametric regression approach to LR and obtain theoretical performance guarantees for this…

Machine Learning · Computer Science 2022-02-11 Dimitris Fotakis , Alkis Kalavasis , Eleni Psaroudaki

Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data

Diagnosing and cleaning data is a crucial step for building robust machine learning systems. However, identifying problems within large-scale datasets with real-world distributions is challenging due to the presence of complex issues such…

Machine Learning · Computer Science 2023-10-31 Jang-Hyun Kim , Sangdoo Yun , Hyun Oh Song

Topic Model Based Multi-Label Classification from the Crowd

Multi-label classification is a common supervised machine learning problem where each instance is associated with multiple classes. The key challenge in this problem is learning the correlations between the classes. An additional challenge…

Machine Learning · Computer Science 2016-04-05 Divya Padmanabhan , Satyanath Bhat , Shirish Shevade , Y. Narahari

Crowd Labeling: a survey

Recently, there has been a burst in the number of research projects on human computation via crowdsourcing. Multiple choice (or labeling) questions could be referred to as a common type of problem which is solved by this approach. As an…

Artificial Intelligence · Computer Science 2014-09-04 Jafar Muhammadi , Hamid Reza Rabiee , Abbas Hosseini

Learning from Noisy Labels with Distillation

The ability of learning from noisy labels is very useful in many visual recognition tasks, as a vast amount of data with noisy labels are relatively easy to obtain. Traditionally, the label noises have been treated as statistical outliers,…

Computer Vision and Pattern Recognition · Computer Science 2017-04-11 Yuncheng Li , Jianchao Yang , Yale Song , Liangliang Cao , Jiebo Luo , Li-Jia Li

Learning with Noisy labels via Self-supervised Adversarial Noisy Masking

Collecting large-scale datasets is crucial for training deep models, annotating the data, however, inevitably yields noisy labels, which poses challenges to deep learning algorithms. Previous efforts tend to mitigate this problem via…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Yuanpeng Tu , Boshen Zhang , Yuxi Li , Liang Liu , Jian Li , Jiangning Zhang , Yabiao Wang , Chengjie Wang , Cai Rong Zhao

Label Noise Types and Their Effects on Deep Learning

The recent success of deep learning is mostly due to the availability of big datasets with clean annotations. However, gathering a cleanly annotated dataset is not always feasible due to practical challenges. As a result, label noise is a…

Computer Vision and Pattern Recognition · Computer Science 2020-03-25 Görkem Algan , İlkay Ulusoy

Fuzzy Label: From Concept to Its Application in Label Learning

Label learning is a fundamental task in machine learning that aims to construct intelligent models using labeled data, encompassing traditional single-label and multi-label classification models. Traditional methods typically rely on…

Machine Learning · Computer Science 2025-11-11 Chenxi Luoa , Zhuangzhuang Zhaoa , Zhaohong Denga , Te Zhangb

Protocols for Learning Classifiers on Distributed Data

We consider the problem of learning classifiers for labeled data that has been distributed across several nodes. Our goal is to find a single classifier, with small approximation error, across all datasets while minimizing the communication…

Machine Learning · Statistics 2012-03-06 Hal Daume , Jeff M. Phillips , Avishek Saha , Suresh Venkatasubramanian

Learning with Noisy Labels: Interconnection of Two Expectation-Maximizations

Labor-intensive labeling becomes a bottleneck in developing computer vision algorithms based on deep learning. For this reason, dealing with imperfect labels has increasingly gained attention and has become an active field of study. We…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Heewon Kim , Hyun Sung Chang , Kiho Cho , Jaeyun Lee , Bohyung Han

Image Classification with Deep Learning in the Presence of Noisy Labels: A Survey

Image classification systems recently made a giant leap with the advancement of deep neural networks. However, these systems require an excessive amount of labeled data to be adequately trained. Gathering a correctly annotated dataset is…

Machine Learning · Computer Science 2021-01-19 Görkem Algan , Ilkay Ulusoy

Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Point cloud segmentation is a fundamental task in 3D vision that serves a wide range of applications. Although great progresses have been made these years, its practical usability is still limited by the availability of training data.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Yixun Liang , Hao He , Shishi Xiao , Hao Lu , Yingcong Chen

Learning Causal Transition Matrix for Instance-dependent Label Noise

Noisy labels are both inevitable and problematic in machine learning methods, as they negatively impact models' generalization ability by causing overfitting. In the context of learning with noise, the transition matrix plays a crucial role…

Machine Learning · Computer Science 2025-03-26 Jiahui Li , Tai-Wei Chang , Kun Kuang , Ximing Li , Long Chen , Jun Zhou

Towards the Identifiability in Noisy Label Learning: A Multinomial Mixture Modelling Approach

Learning from noisy labels (LNL) is crucial in deep learning, in which one of the approaches is to identify clean-label samples from poorly-annotated datasets. Such an identification is challenging because the conventional LNL problem,…

Machine Learning · Computer Science 2025-09-26 Cuong Nguyen , Thanh-Toan Do , Gustavo Carneiro

Probabilistic Decoupling of Labels in Classification

In this paper we develop a principled, probabilistic, unified approach to non-standard classification tasks, such as semi-supervised, positive-unlabelled, multi-positive-unlabelled and noisy-label learning. We train a classifier on the…

Machine Learning · Computer Science 2020-06-17 Jeppe Nørregaard , Lars Kai Hansen

In-Context Learning with Noisy Labels

In-context learning refers to the emerging ability of large language models (LLMs) to perform a target task without additional training, utilizing demonstrations of the task. Recent studies aim to enhance in-context learning performance by…

Computation and Language · Computer Science 2024-12-02 Junyong Kang , Donghyun Son , Hwanjun Song , Buru Chang

Multi-Label Bayesian Active Learning with Inter-Label Relationships

The primary challenge of multi-label active learning, differing it from multi-class active learning, lies in assessing the informativeness of an indefinite number of labels while also accounting for the inherited label correlation. Existing…

Machine Learning · Computer Science 2025-09-05 Yuanyuan Qi , Jueqing Lu , Xiaohao Yang , Joanne Enticott , Lan Du