Related papers: Negative Confidence-Aware Weakly Supervised Binary…

Lifting Weak Supervision To Structured Prediction

Weak supervision (WS) is a rich set of techniques that produce pseudolabels by aggregating easily obtained but potentially noisy label estimates from a variety of sources. WS is theoretically well understood for binary classification, where…

Machine Learning · Computer Science 2022-11-28 Harit Vishwakarma , Nicholas Roberts , Frederic Sala

Binary Classification from Positive Data with Skewed Confidence

Positive-confidence (Pconf) classification [Ishida et al., 2018] is a promising weakly-supervised learning method which trains a binary classifier only from positive data equipped with confidence. However, in practice, the confidence may be…

Machine Learning · Statistics 2020-01-30 Kazuhiko Shinoda , Hirotaka Kaji , Masashi Sugiyama

Classification from Positive and Biased Negative Data with Skewed Labeled Posterior Probability

The binary classification problem has a situation where only biased data are observed in one of the classes. In this paper, we propose a new method to approach the positive and biased negative (PbN) classification problem, which is a weakly…

Methodology · Statistics 2025-10-28 Shotaro Watanabe , Hidetoshi Matsui

Binary Classification with Confidence Difference

Recently, learning with soft labels has been shown to achieve better performance than learning with hard labels in terms of model generalization, calibration, and robustness. However, collecting pointwise labeling confidence for all…

Machine Learning · Computer Science 2023-10-10 Wei Wang , Lei Feng , Yuchen Jiang , Gang Niu , Min-Ling Zhang , Masashi Sugiyama

Binary Classification with Positive Labeling Sources

To create a large amount of training labels for machine learning models effectively and efficiently, researchers have turned to Weak Supervision (WS), which uses programmatic labeling sources rather than manual annotation. Existing works of…

Machine Learning · Computer Science 2022-08-04 Jieyu Zhang , Yujing Wang , Yaming Yang , Yang Luo , Alexander Ratner

Correct and Weight: A Simple Yet Effective Loss for Implicit Feedback Recommendation

Learning from implicit feedback has become the standard paradigm for modern recommender systems. However, this setting is fraught with the persistent challenge of false negatives, where unobserved user-item interactions are not necessarily…

Information Retrieval · Computer Science 2026-01-09 Minglei Yin , Chuanbo Hu , Bin Liu , Neil Zhenqiang Gong , Yanfang , Ye , Xin Li

Learning from Similarity-Confidence and Confidence-Difference

In practical machine learning applications, it is often challenging to assign accurate labels to data, and increasing the number of labeled instances is often limited. In such cases, Weakly Supervised Learning (WSL), which enables training…

Machine Learning · Computer Science 2026-03-24 Tomoya Tate , Kosuke Sugiyama , Masato Uchida

Importance Reweighting for Biquality Learning

The field of Weakly Supervised Learning (WSL) has recently seen a surge of popularity, with numerous papers addressing different types of "supervision deficiencies", namely: poor quality, non adaptability, and insufficient quantity of…

Machine Learning · Computer Science 2021-09-28 Pierre Nodet , Vincent Lemaire , Alexis Bondu , Antoine Cornuéjols

Classification from Positive, Unlabeled and Biased Negative Data

In binary classification, there are situations where negative (N) data are too diverse to be fully labeled and we often resort to positive-unlabeled (PU) learning in these scenarios. However, collecting a non-representative N set that…

Machine Learning · Computer Science 2019-07-16 Yu-Guan Hsieh , Gang Niu , Masashi Sugiyama

Learning from Similarity-Confidence Data

Weakly supervised learning has drawn considerable attention recently to reduce the expensive time and labor consumption of labeling massive data. In this paper, we investigate a novel weakly supervised learning problem of learning from…

Machine Learning · Statistics 2021-02-16 Yuzhou Cao , Lei Feng , Yitian Xu , Bo An , Gang Niu , Masashi Sugiyama

AutoWS: Automated Weak Supervision Framework for Text Classification

Creating large, good quality labeled data has become one of the major bottlenecks for developing machine learning applications. Multiple techniques have been developed to either decrease the dependence of labeled data (zero/few-shot…

Computation and Language · Computer Science 2023-02-08 Abhinav Bohra , Huy Nguyen , Devashish Khatwani

Generalized Weak Supervision for Neural Information Retrieval

Neural ranking models (NRMs) have demonstrated effective performance in several information retrieval (IR) tasks. However, training NRMs often requires large-scale training data, which is difficult and expensive to obtain. To address this…

Information Retrieval · Computer Science 2023-04-19 Yen-Chieh Lien , Hamed Zamani , W. Bruce Croft

CW-BASS: Confidence-Weighted Boundary-Aware Learning for Semi-Supervised Semantic Segmentation

Semi-supervised semantic segmentation (SSSS) aims to improve segmentation performance by utilizing large amounts of unlabeled data with limited labeled samples. Existing methods often suffer from coupling, where over-reliance on initial…

Computer Vision and Pattern Recognition · Computer Science 2025-04-10 Ebenezer Tarubinga , Jenifer Kalafatovich , Seong-Whan Lee

Unlocking the Hidden Treasures: Enhancing Recommendations with Unlabeled Data

Collaborative filtering (CF) stands as a cornerstone in recommender systems, yet effectively leveraging the massive unlabeled data presents a significant challenge. Current research focuses on addressing the challenge of unlabeled data by…

Information Retrieval · Computer Science 2024-12-25 Yuhan Zhao , Rui Chen , Qilong Han , Hongtao Song , Li Chen

Weakly Supervised Classification Using Group-Level Labels

In many applications, finding adequate labeled data to train predictive models is a major challenge. In this work, we propose methods to use group-level binary labels as weak supervision to train instance-level binary classification models.…

Machine Learning · Computer Science 2021-08-18 Guruprasad Nayak , Rahul Ghosh , Xiaowei Jia , Vipin Kumar

Learning from Similarity/Dissimilarity and Pairwise Comparison

This paper addresses binary classification in scenarios where obtaining explicit instance level labels is impractical, by exploiting multiple weak labels defined on instance pairs. The existing SconfConfDiff classification framework relies…

Machine Learning · Computer Science 2026-03-23 Tomoya Tate , Kosuke Sugiyama , Masato Uchida

Positive Unlabeled Contrastive Learning

Self-supervised pretraining on unlabeled data followed by supervised fine-tuning on labeled data is a popular paradigm for learning from limited labeled examples. We extend this paradigm to the classical positive unlabeled (PU) setting,…

Machine Learning · Computer Science 2024-04-01 Anish Acharya , Sujay Sanghavi , Li Jing , Bhargav Bhushanam , Dhruv Choudhary , Michael Rabbat , Inderjit Dhillon

Binary Classification from Positive-Confidence Data

Can we learn a binary classifier from only positive data, without any negative data or unlabeled data? We show that if one can equip positive data with confidence (positive-confidence), one can successfully learn a binary classifier, which…

Machine Learning · Statistics 2018-11-29 Takashi Ishida , Gang Niu , Masashi Sugiyama

Rethinking Weak Supervision in Helping Contrastive Learning

Contrastive learning has shown outstanding performances in both supervised and unsupervised learning, and has recently been introduced to solve weakly supervised learning problems such as semi-supervised learning and noisy label learning.…

Machine Learning · Computer Science 2023-06-08 Jingyi Cui , Weiran Huang , Yifei Wang , Yisen Wang

Bayesian Negative Sampling for Recommendation

How to sample high quality negative instances from unlabeled data, i.e., negative sampling, is important for training implicit collaborative filtering and contrastive learning models. Although previous studies have proposed some approaches…

Information Retrieval · Computer Science 2022-07-12 Bin Liu , Bang Wang