English

InstanT: Semi-supervised Learning with Instance-dependent Thresholds

Machine Learning 2023-10-31 v1 Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning

Abstract

Semi-supervised learning (SSL) has been a fundamental challenge in machine learning for decades. The primary family of SSL algorithms, known as pseudo-labeling, involves assigning pseudo-labels to confident unlabeled instances and incorporating them into the training set. Therefore, the selection criteria of confident instances are crucial to the success of SSL. Recently, there has been growing interest in the development of SSL methods that use dynamic or adaptive thresholds. Yet, these methods typically apply the same threshold to all samples, or use class-dependent thresholds for instances belonging to a certain class, while neglecting instance-level information. In this paper, we propose the study of instance-dependent thresholds, which has the highest degree of freedom compared with existing methods. Specifically, we devise a novel instance-dependent threshold function for all unlabeled instances by utilizing their instance-level ambiguity and the instance-dependent error rates of pseudo-labels, so instances that are more likely to have incorrect pseudo-labels will have higher thresholds. Furthermore, we demonstrate that our instance-dependent threshold function provides a bounded probabilistic guarantee for the correctness of the pseudo-labels it assigns.

Keywords

Cite

@article{arxiv.2310.18910,
  title  = {InstanT: Semi-supervised Learning with Instance-dependent Thresholds},
  author = {Muyang Li and Runze Wu and Haoyu Liu and Jun Yu and Xun Yang and Bo Han and Tongliang Liu},
  journal= {arXiv preprint arXiv:2310.18910},
  year   = {2023}
}

Comments

Accepted as poster for NeurIPS 2023