Related papers: Exploratory Machine Learning with Unknown Unknowns

Identifying Unknown Unknowns in the Open World: Representations and Policies for Guided Exploration

Predictive models deployed in the real world may assign incorrect labels to instances with high confidence. Such errors or unknown unknowns are rooted in model incompleteness, and typically arise because of the mismatch between training…

Artificial Intelligence · Computer Science 2016-12-13 Himabindu Lakkaraju , Ece Kamar , Rich Caruana , Eric Horvitz

Managing the unknown: a survey on Open Set Recognition and tangential areas

In real-world scenarios classification models are often required to perform robustly when predicting samples belonging to classes that have not appeared during its training stage. Open Set Recognition addresses this issue by devising models…

Machine Learning · Computer Science 2024-01-08 Marcos Barcina-Blanco , Jesus L. Lobo , Pablo Garcia-Bringas , Javier Del Ser

Exploratory Learning

In multiclass semi-supervised learning (SSL), it is sometimes the case that the number of classes present in the data is not known, and hence no labeled examples are provided for some classes. In this paper we present variants of well-known…

Machine Learning · Computer Science 2013-07-02 Bhavana Dalvi , William W. Cohen , Jamie Callan

Toward Supervised Anomaly Detection

Anomaly detection is being regarded as an unsupervised learning task as anomalies stem from adversarial or unlikely events with unknown distributions. However, the predictive performance of purely unsupervised anomaly detection often fails…

Machine Learning · Computer Science 2014-01-27 Nico Goernitz , Marius Micha Kloft , Konrad Rieck , Ulf Brefeld

Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data

We consider the problem of anomaly detection with a small set of partially labeled anomaly examples and a large-scale unlabeled dataset. This is a common scenario in many important applications. Existing related methods either exclusively…

Machine Learning · Computer Science 2021-06-11 Guansong Pang , Anton van den Hengel , Chunhua Shen , Longbing Cao

Addressing Missing Sources with Adversarial Support-Matching

When trained on diverse labeled data, machine learning models have proven themselves to be a powerful tool in all facets of society. However, due to budget limitations, deliberate or non-deliberate censorship, and other problems during data…

Machine Learning · Statistics 2022-03-25 Thomas Kehrenberg , Myles Bartlett , Viktoriia Sharmanska , Novi Quadrianto

Improvability Through Semi-Supervised Learning: A Survey of Theoretical Results

Semi-supervised learning is a setting in which one has labeled and unlabeled data available. In this survey we explore different types of theoretical results when one uses unlabeled data in classification and regression tasks. Most methods…

Machine Learning · Computer Science 2020-07-31 Alexander Mey , Marco Loog

Active Learning Using Uncertainty Information

Many active learning methods belong to the retraining-based approaches, which select one unlabeled instance, add it to the training set with its possible labels, retrain the classification model, and evaluate the criteria that we base our…

Machine Learning · Statistics 2017-03-01 Yazhou Yang , Marco Loog

Unknown Examples & Machine Learning Model Generalization

Over the past decades, researchers and ML practitioners have come up with better and better ways to build, understand and improve the quality of ML models, but mostly under the key assumption that the training data is distributed…

Machine Learning · Computer Science 2019-10-14 Yeounoh Chung , Peter J. Haas , Eli Upfal , Tim Kraska

Reinforcement Learning Based Multi-modal Feature Fusion Network for Novel Class Discovery

With the development of deep learning techniques, supervised learning has achieved performances surpassing those of humans. Researchers have designed numerous corresponding models for different data modalities, achieving excellent results…

Artificial Intelligence · Computer Science 2023-08-29 Qiang Li , Qiuyang Ma , Weizhi Nie , Anan Liu

Are labels informative in semi-supervised learning? -- Estimating and leveraging the missing-data mechanism

Semi-supervised learning is a powerful technique for leveraging unlabeled data to improve machine learning models, but it can be affected by the presence of ``informative'' labels, which occur when some classes are more likely to be labeled…

Machine Learning · Statistics 2023-02-16 Aude Sportisse , Hugo Schmutz , Olivier Humbert , Charles Bouveyron , Pierre-Alexandre Mattei

Reliable Semi-Supervised Learning when Labels are Missing at Random

Semi-supervised learning methods are motivated by the availability of large datasets with unlabeled features in addition to labeled data. Unlabeled data is, however, not guaranteed to improve classification performance and has in fact been…

Machine Learning · Statistics 2019-10-25 Xiuming Liu , Dave Zachariah , Johan Wågberg , Thomas B. Schön

Open-Set Representation Learning through Combinatorial Embedding

Visual recognition tasks are often limited to dealing with a small subset of classes simply because the labels for the remaining classes are unavailable. We are interested in identifying novel concepts in a dataset through representation…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Geeho Kim , Junoh Kang , Bohyung Han

Unsupervised Embedding Learning from Uncertainty Momentum Modeling

Existing popular unsupervised embedding learning methods focus on enhancing the instance-level local discrimination of the given unlabeled images by exploring various negative data. However, the existed sample outliers which exhibit large…

Computer Vision and Pattern Recognition · Computer Science 2021-07-20 Jiahuan Zhou , Yansong Tang , Bing Su , Ying Wu

Detecting the Unexpected via Image Resynthesis

Classical semantic segmentation methods, including the recent deep learning ones, assume that all classes observed at test time have been seen during training. In this paper, we tackle the more realistic scenario where unexpected objects of…

Computer Vision and Pattern Recognition · Computer Science 2019-04-18 Krzysztof Lis , Krishna Nakka , Pascal Fua , Mathieu Salzmann

Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection

Machine learning-based techniques open up many opportunities and improvements to derive deeper and more practical insights from data that can help businesses make informed decisions. However, the majority of these techniques focus on the…

Machine Learning · Computer Science 2024-05-10 Atefeh Mahdavi , Marco Carvalho

Open-set learning with augmented categories by exploiting unlabelled data

Novel categories are commonly defined as those unobserved during training but present during testing. However, partially labelled training datasets can contain unlabelled training samples that belong to novel categories, meaning these can…

Machine Learning · Statistics 2023-11-01 Emile R. Engelbrecht , Johan A. du Preez

Unreasonable effectiveness of unsupervised learning in identifying Majorana topology

In unsupervised learning, the training data for deep learning does not come with any labels, thus forcing the algorithm to discover hidden patterns in the data for discerning useful information. This, in principle, could be a powerful tool…

Disordered Systems and Neural Networks · Physics 2025-12-17 Jacob Taylor , Haining Pan , Sankar Das Sarma

Unsupervised feature learning with discriminative encoder

In recent years, deep discriminative models have achieved extraordinary performance on supervised learning tasks, significantly outperforming their generative counterparts. However, their success relies on the presence of a large amount of…

Computer Vision and Pattern Recognition · Computer Science 2017-09-05 Gaurav Pandey , Ambedkar Dukkipati

Learning to Learn in a Semi-Supervised Fashion

To address semi-supervised learning from both labeled and unlabeled data, we present a novel meta-learning scheme. We particularly consider that labeled and unlabeled data share disjoint ground truth label sets, which can be seen tasks like…

Computer Vision and Pattern Recognition · Computer Science 2020-08-26 Yun-Chun Chen , Chao-Te Chou , Yu-Chiang Frank Wang