Related papers: Exploratory Learning

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

Semi-Supervised Learning (SSL) is a framework that utilizes both labeled and unlabeled data to enhance model performance. Conventional SSL methods operate under the assumption that labeled and unlabeled data share the same label space.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Noam Fluss , Guy Hacohen , Daphna Weinshall

Exploratory Machine Learning with Unknown Unknowns

In conventional supervised learning, a training dataset is given with ground-truth labels from a known label set, and the learned model will classify unseen instances to known labels. This paper studies a new problem setting in which there…

Machine Learning · Computer Science 2024-06-03 Peng Zhao , Jia-Wei Shan , Yu-Jie Zhang , Zhi-Hua Zhou

Informative missingness and its implications in semi-supervised learning

Semi-supervised learning (SSL) constructs classifiers using both labelled and unlabelled data. It leverages information from labelled samples, whose acquisition is often costly or labour-intensive, together with unlabelled data to enhance…

Machine Learning · Statistics 2025-12-29 Jinran Wu , You-Gan Wang , Geoffrey J. McLachlan

Robust Semi-Supervised Learning for Self-learning Open-World Classes

Existing semi-supervised learning (SSL) methods assume that labeled and unlabeled data share the same class space. However, in real-world applications, unlabeled data always contain classes not present in the labeled set, which may cause…

Machine Learning · Computer Science 2024-01-17 Wenjuan Xi , Xin Song , Weili Guo , Yang Yang

Unsupervised Selective Labeling for More Effective Semi-Supervised Learning

Given an unlabeled dataset and an annotation budget, we study how to selectively label a fixed number of instances so that semi-supervised learning (SSL) on such a partially labeled dataset is most effective. We focus on selecting the right…

Machine Learning · Computer Science 2023-08-24 Xudong Wang , Long Lian , Stella X. Yu

Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Semi-supervised learning (SSL) is a common approach to learning predictive models using not only labeled examples, but also unlabeled examples. While SSL for the simple tasks of classification and regression has received a lot of attention…

Machine Learning · Computer Science 2024-04-02 Jurica Levatić , Michelangelo Ceci , Dragi Kocev , Sašo Džeroski

Semi-supervised Learning with Contrastive Predicative Coding

Semi-supervised learning (SSL) provides a powerful framework for leveraging unlabeled data when labels are limited or expensive to obtain. SSL algorithms based on deep neural networks have recently proven successful on standard benchmark…

Machine Learning · Computer Science 2019-05-28 Jiaxing Wang , Yin Zheng , Xiaoshuang Chen , Junzhou Huang , Jian Cheng

Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning

Semi-supervised learning (SSL) has been proposed to leverage unlabeled data for training powerful models when only limited labeled data is available. While existing SSL methods assume that samples in the labeled and unlabeled data share the…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Qing Yu , Daiki Ikami , Go Irie , Kiyoharu Aizawa

Semi-Supervised Learning with Declaratively Specified Entropy Constraints

We propose a technique for declaratively specifying strategies for semi-supervised learning (SSL). The proposed method can be used to specify ensembles of semi-supervised learning, as well as agreement constraints and entropic…

Machine Learning · Computer Science 2018-05-21 Haitian Sun , William W. Cohen , Lidong Bing

Semi-Supervised Learning with Scarce Annotations

While semi-supervised learning (SSL) algorithms provide an efficient way to make use of both labelled and unlabelled data, they generally struggle when the number of annotated samples is very small. In this work, we consider the problem of…

Computer Vision and Pattern Recognition · Computer Science 2020-04-23 Sylvestre-Alvise Rebuffi , Sebastien Ehrhardt , Kai Han , Andrea Vedaldi , Andrew Zisserman

Robust Semi-Supervised Learning in Open Environments

Semi-supervised learning (SSL) aims to improve performance by exploiting unlabeled data when labels are scarce. Conventional SSL studies typically assume close environments where important factors (e.g., label, feature, distribution)…

Machine Learning · Computer Science 2024-12-25 Lan-Zhe Guo , Lin-Han Jia , Jie-Jing Shao , Yu-Feng Li

OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning

Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while…

Computer Vision and Pattern Recognition · Computer Science 2022-07-29 Mamshad Nayeem Rizve , Navid Kardan , Salman Khan , Fahad Shahbaz Khan , Mubarak Shah

Semi-Supervised Learning of Classifiers from a Statistical Perspective: A Brief Review

There has been increasing attention to semi-supervised learning (SSL) approaches in machine learning to forming a classifier in situations where the training data for a classifier consists of a limited number of classified observations but…

Machine Learning · Statistics 2021-11-10 Daniel Ahfock , Geoffrey J. McLachlan

Semi-Supervised Empirical Risk Minimization: Using unlabeled data to improve prediction

We present a general methodology for using unlabeled data to design semi supervised learning (SSL) variants of the Empirical Risk Minimization (ERM) learning process. Focusing on generalized linear regression, we analyze of the…

Machine Learning · Statistics 2022-03-08 Oren Yuval , Saharon Rosset

Unlabeled Data vs. Pre-trained Knowledge: Rethinking SSL in the Era of Large Models

Semi-supervised learning (SSL) alleviates the cost of data labeling process by exploiting unlabeled data and has achieved promising results. Meanwhile, with the development of large foundation models, exploiting pre-trained models becomes a…

Machine Learning · Computer Science 2025-10-28 Song-Lin Lv , Rui Zhu , Tong Wei , Yu-Feng Li , Lan-Zhe Guo

Complementing Semi-Supervised Learning with Uncertainty Quantification

The problem of fully supervised classification is that it requires a tremendous amount of annotated data, however, in many datasets a large portion of data is unlabeled. To alleviate this problem semi-supervised learning (SSL) leverages the…

Machine Learning · Computer Science 2022-07-26 Ehsan Kazemi

A Survey on Self-supervised Learning: Algorithms, Applications, and Future Trends

Deep supervised learning algorithms typically require a large volume of labeled data to achieve satisfactory performance. However, the process of collecting and labeling such data can be expensive and time-consuming. Self-supervised…

Machine Learning · Computer Science 2024-07-16 Jie Gui , Tuo Chen , Jing Zhang , Qiong Cao , Zhenan Sun , Hao Luo , Dacheng Tao

Class-Imbalanced Semi-Supervised Learning

Semi-Supervised Learning (SSL) has achieved great success in overcoming the difficulties of labeling and making full use of unlabeled data. However, SSL has a limited assumption that the numbers of samples in different classes are balanced,…

Machine Learning · Computer Science 2020-02-18 Minsung Hyun , Jisoo Jeong , Nojun Kwak

Exploration and Exploitation of Unlabeled Data for Open-Set Semi-Supervised Learning

In this paper, we address a complex but practical scenario in semi-supervised learning (SSL) named open-set SSL, where unlabeled data contain both in-distribution (ID) and out-of-distribution (OOD) samples. Unlike previous methods that only…

Computer Vision and Pattern Recognition · Computer Science 2023-07-03 Ganlong Zhao , Guanbin Li , Yipeng Qin , Jinjin Zhang , Zhenhua Chai , Xiaolin Wei , Liang Lin , Yizhou Yu

Improving realistic semi-supervised learning with doubly robust estimation

A major challenge in Semi-Supervised Learning (SSL) is the limited information available about the class distribution in the unlabeled data. In many real-world applications this arises from the prevalence of long-tailed distributions, where…

Machine Learning · Computer Science 2025-02-04 Khiem Pham , Charles Herrmann , Ramin Zabih