Related papers: Multi-class Classification from Multiple Unlabeled…

Reliable Semi-Supervised Learning when Labels are Missing at Random

Semi-supervised learning methods are motivated by the availability of large datasets with unlabeled features in addition to labeled data. Unlabeled data is, however, not guaranteed to improve classification performance and has in fact been…

Machine Learning · Statistics 2019-10-25 Xiuming Liu , Dave Zachariah , Johan Wågberg , Thomas B. Schön

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach

The recently proposed unlabeled-unlabeled (UU) classification method allows us to train a binary classifier only from two unlabeled datasets with different class priors. Since this method is based on the empirical risk minimization, it…

Machine Learning · Computer Science 2020-04-01 Nan Lu , Tianyi Zhang , Gang Niu , Masashi Sugiyama

Modeling Multiple Annotator Expertise in the Semi-Supervised Learning Scenario

Learning algorithms normally assume that there is at most one annotation or label per data point. However, in some scenarios, such as medical diagnosis and on-line collaboration,multiple annotations may be available. In either case,…

Machine Learning · Computer Science 2012-03-19 Yan Yan , Romer Rosales , Glenn Fung , Jennifer Dy

Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels

Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-02 XIn Zhang , Yuqi Song , Fei Zuo , Xiaofeng Wang

Rethinking the Value of Labels for Improving Class-Imbalanced Learning

Real-world data often exhibits long-tailed distributions with heavy class imbalance, posing great challenges for deep recognition models. We identify a persisting dilemma on the value of labels in the context of imbalanced learning: on the…

Machine Learning · Computer Science 2020-09-29 Yuzhe Yang , Zhi Xu

Semi-Supervised Deep Learning Using Improved Unsupervised Discriminant Projection

Deep learning demands a huge amount of well-labeled data to train the network parameters. How to use the least amount of labeled data to obtain the desired classification accuracy is of great practical significance, because for many…

Machine Learning · Computer Science 2019-12-20 Xiao Han , Zihao Wang , Enmei Tu , Gunnam Suryanarayana , Jie Yang

Semi-Supervised Collective Classification via Hybrid Label Regularization

Many classification problems involve data instances that are interlinked with each other, such as webpages connected by hyperlinks. Techniques for "collective classification" (CC) often increase accuracy for such data graphs, but usually…

Machine Learning · Computer Science 2012-07-03 Luke McDowell , David Aha

Regularization With Stochastic Transformations and Perturbations for Deep Semi-Supervised Learning

Effective convolutional neural networks are trained on large sets of labeled data. However, creating large labeled datasets is a very costly and time-consuming task. Semi-supervised learning uses unlabeled data to train a model with higher…

Computer Vision and Pattern Recognition · Computer Science 2016-06-16 Mehdi Sajjadi , Mehran Javanmardi , Tolga Tasdizen

Multi-class Classification without Multi-class Labels

This work presents a new strategy for multi-class classification that requires no class-specific labels, but instead leverages pairwise similarity between examples, which is a weaker form of annotation. The proposed method, meta…

Machine Learning · Computer Science 2019-01-04 Yen-Chang Hsu , Zhaoyang Lv , Joel Schlosser , Phillip Odom , Zsolt Kira

Semi-Supervised Learning with Meta-Gradient

In this work, we propose a simple yet effective meta-learning algorithm in semi-supervised learning. We notice that most existing consistency-based approaches suffer from overfitting and limited model generalization ability, especially when…

Machine Learning · Computer Science 2021-03-18 Xin-Yu Zhang , Taihong Xiao , Haolin Jia , Ming-Ming Cheng , Ming-Hsuan Yang

Estimating the Accuracies of Multiple Classifiers Without Labeled Data

In various situations one is given only the predictions of multiple classifiers over a large unlabeled test data. This scenario raises the following questions: Without any labeled data and without any a-priori knowledge about the…

Machine Learning · Statistics 2014-10-31 Ariel Jaffe , Boaz Nadler , Yuval Kluger

Mutual Exclusivity Loss for Semi-Supervised Deep Learning

In this paper we consider the problem of semi-supervised learning with deep Convolutional Neural Networks (ConvNets). Semi-supervised learning is motivated on the observation that unlabeled data is cheap and can be used to improve the…

Computer Vision and Pattern Recognition · Computer Science 2016-06-13 Mehdi Sajjadi , Mehran Javanmardi , Tolga Tasdizen

Clustering Unclustered Data: Unsupervised Binary Labeling of Two Datasets Having Different Class Balances

We consider the unsupervised learning problem of assigning labels to unlabeled data. A naive approach is to use clustering methods, but this works well only when data is properly clustered and each cluster corresponds to an underlying…

Machine Learning · Computer Science 2013-05-02 Marthinus Christoffel du Plessis , Masashi Sugiyama

Multi-Complementary and Unlabeled Learning for Arbitrary Losses and Models

A weakly-supervised learning framework named as complementary-label learning has been proposed recently, where each sample is equipped with a single complementary label that denotes one of the classes the sample does not belong to. However,…

Machine Learning · Statistics 2020-07-24 Yuzhou Cao , Shuqi Liu , Yitian Xu

Poisoning the Unlabeled Dataset of Semi-Supervised Learning

Semi-supervised machine learning models learn from a (small) set of labeled training examples, and a (large) set of unlabeled training examples. State-of-the-art models can reach within a few percentage points of fully-supervised training,…

Machine Learning · Computer Science 2021-08-11 Nicholas Carlini

Semi-Unsupervised Learning: Clustering and Classifying using Ultra-Sparse Labels

In semi-supervised learning for classification, it is assumed that every ground truth class of data is present in the small labelled dataset. Many real-world sparsely-labelled datasets are plausibly not of this type. It could easily be the…

Machine Learning · Statistics 2021-01-11 Matthew Willetts , Stephen J Roberts , Christopher C Holmes

Unsupervised Supervised Learning II: Training Margin Based Classifiers without Labels

Many popular linear classifiers, such as logistic regression, boosting, or SVM, are trained by optimizing a margin-based risk function. Traditionally, these risk functions are computed based on a labeled dataset. We develop a novel…

Machine Learning · Computer Science 2010-07-23 Krishnakumar Balasubramanian , Pinar Donmez , Guy Lebanon

Learning from Complementary Labels

Collecting labeled data is costly and thus a critical bottleneck in real-world classification tasks. To mitigate this problem, we propose a novel setting, namely learning from complementary labels for multi-class classification. A…

Machine Learning · Statistics 2017-11-15 Takashi Ishida , Gang Niu , Weihua Hu , Masashi Sugiyama

Risk-Consistent Multiclass Learning from Random Label-Subset Membership Queries

Obtaining accurate class labels is often costly or unreliable, and may also be limited by privacy or other practical conditions. Compared with asking an annotator to provide the exact class, it is often easier to ask whether the true label…

Machine Learning · Computer Science 2026-05-11 Jiaxu Su , Junpeng Li , Changchun Hua , Yana Yang

Multi-class Probabilistic Bounds for Self-learning

Self-learning is a classical approach for learning with both labeled and unlabeled observations which consists in giving pseudo-labels to unlabeled training instances with a confidence score over a predetermined threshold. At the same time,…

Machine Learning · Computer Science 2021-09-30 Vasilii Feofanov , Emilie Devijver , Massih-Reza Amini