Related papers: Explicit Mutual Information Maximization for Self-…

More Synergy, Less Redundancy: Exploiting Joint Mutual Information for Self-Supervised Learning

Self-supervised learning (SSL) is now a serious competitor for supervised learning, even though it does not require data annotation. Several baselines have attempted to make SSL models exploit information about data distribution, and less…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Salman Mohamadi , Gianfranco Doretto , Donald A. Adjeroh

Understanding Self-Supervised Learning via Latent Distribution Matching

Self-supervised learning (SSL) excels at finding general-purpose latent representations from complex data, yet lacks a unifying theoretical framework that explains the diverse existing methods and guides the design of new ones. We cast SSL…

Machine Learning · Computer Science 2026-05-28 Fabian A Mikulasch , Friedemann Zenke

Rethinking Self-Supervised Learning Within the Framework of Partial Information Decomposition

Self Supervised learning (SSL) has demonstrated its effectiveness in feature learning from unlabeled data. Regarding this success, there have been some arguments on the role that mutual information plays within the SSL framework. Some works…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 Salman Mohamadi , Gianfranco Doretto , Donald A. Adjeroh

Maximally Useful and Minimally Redundant: The Key to Self Supervised Learning for Imbalanced Data

Contrastive self supervised learning(CSSL) usually makes use of the multi-view assumption which states that all relevant information must be shared between all views. The main objective of CSSL is to maximize the mutual information(MI)…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Yash Kumar Sharma , Vineet Padmanabhan

MIM: Mutual Information Machine

We introduce the Mutual Information Machine (MIM), a probabilistic auto-encoder for learning joint distributions over observations and latent variables. MIM reflects three design principles: 1) low divergence, to encourage the encoder and…

Machine Learning · Computer Science 2020-02-24 Micha Livne , Kevin Swersky , David J. Fleet

Analysis of High-dimensional Gaussian Labeled-unlabeled Mixture Model via Message-passing Algorithm

Semi-supervised learning (SSL) is a machine learning methodology that leverages unlabeled data in conjunction with a limited amount of labeled data. Although SSL has been applied in various applications and its effectiveness has been…

Machine Learning · Computer Science 2025-03-14 Xiaosi Gu , Tomoyuki Obuchi

Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning

Self-supervised learning (SSL) has emerged as a crucial technique in image processing, encoding, and understanding, especially for developing today's vision foundation models that utilize large-scale datasets without annotations to enhance…

Computer Vision and Pattern Recognition · Computer Science 2025-01-08 Chuang Niu , Wenjun Xia , Hongming Shan , Ge Wang

Mutual Information Maximization for Effective Lip Reading

Lip reading has received an increasing research interest in recent years due to the rapid development of deep learning and its widespread potential applications. One key point to obtain good performance for the lip reading task depends…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Xing Zhao , Shuang Yang , Shiguang Shan , Xilin Chen

Semi-Supervised Empirical Risk Minimization: Using unlabeled data to improve prediction

We present a general methodology for using unlabeled data to design semi supervised learning (SSL) variants of the Empirical Risk Minimization (ERM) learning process. Focusing on generalized linear regression, we analyze of the…

Machine Learning · Statistics 2022-03-08 Oren Yuval , Saharon Rosset

Information theoretic underpinning of self-supervised learning by clustering

Self-supervised learning (SSL) is recognized as an essential tool for building foundation models for Artificial Intelligence applications. The advances in SSL have been made thanks to vigorous arguments about the principles of SSL and…

Machine Learning · Computer Science 2026-05-13 Josef Kittler , Sara Atito , Muhammad Awais

High Mutual Information in Representation Learning with Symmetric Variational Inference

We introduce the Mutual Information Machine (MIM), a novel formulation of representation learning, using a joint distribution over the observations and latent state in an encoder/decoder framework. Our key principles are symmetry and mutual…

Machine Learning · Statistics 2019-10-10 Micha Livne , Kevin Swersky , David J. Fleet

Theoretical Analysis of Submodular Information Measures for Targeted Data Subset Selection

With increasing volume of data being used across machine learning tasks, the capability to target specific subsets of data becomes more important. To aid in this capability, the recently proposed Submodular Mutual Information (SMI) has been…

Machine Learning · Computer Science 2024-10-28 Nathan Beck , Truong Pham , Rishabh Iyer

Meta-Semi: A Meta-learning Approach for Semi-supervised Learning

Deep learning based semi-supervised learning (SSL) algorithms have led to promising results in recent years. However, they tend to introduce multiple tunable hyper-parameters, making them less practical in real SSL scenarios where the…

Machine Learning · Computer Science 2024-10-30 Yulin Wang , Jiayi Guo , Shiji Song , Gao Huang

SeMi: When Imbalanced Semi-Supervised Learning Meets Mining Hard Examples

Semi-Supervised Learning (SSL) can leverage abundant unlabeled data to boost model performance. However, the class-imbalanced data distribution in real-world scenarios poses great challenges to SSL, resulting in performance degradation.…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Yin Wang , Zixuan Wang , Hao Lu , Zhen Qin , Hailiang Zhao , Guanjie Cheng , Ge Su , Li Kuang , Mengchu Zhou , Shuiguang Deng

Self-Supervised Learning with Kernel Dependence Maximization

We approach self-supervised learning of image representations from a statistical dependence perspective, proposing Self-Supervised Learning with the Hilbert-Schmidt Independence Criterion (SSL-HSIC). SSL-HSIC maximizes dependence between…

Machine Learning · Statistics 2021-12-06 Yazhe Li , Roman Pogodin , Danica J. Sutherland , Arthur Gretton

On the Out-of-Distribution Generalization of Self-Supervised Learning

In this paper, we focus on the out-of-distribution (OOD) generalization of self-supervised learning (SSL). By analyzing the mini-batch construction during the SSL training phase, we first give one plausible explanation for SSL having OOD…

Machine Learning · Computer Science 2025-05-23 Wenwen Qiang , Jingyao Wang , Zeen Song , Jiangmeng Li , Changwen Zheng

MaxMatch: Semi-Supervised Learning with Worst-Case Consistency

In recent years, great progress has been made to incorporate unlabeled data to overcome the inefficiently supervised problem via semi-supervised learning (SSL). Most state-of-the-art models are based on the idea of pursuing consistent model…

Machine Learning · Computer Science 2022-09-27 Yangbangyan Jiang , Xiaodan Li , Yuefeng Chen , Yuan He , Qianqian Xu , Zhiyong Yang , Xiaochun Cao , Qingming Huang

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

Semi-Supervised Learning (SSL) is a framework that utilizes both labeled and unlabeled data to enhance model performance. Conventional SSL methods operate under the assumption that labeled and unlabeled data share the same label space.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Noam Fluss , Guy Hacohen , Daphna Weinshall

Informative missingness and its implications in semi-supervised learning

Semi-supervised learning (SSL) constructs classifiers using both labelled and unlabelled data. It leverages information from labelled samples, whose acquisition is often costly or labour-intensive, together with unlabelled data to enhance…

Machine Learning · Statistics 2025-12-29 Jinran Wu , You-Gan Wang , Geoffrey J. McLachlan

Semi-supervised Learning based on Distributionally Robust Optimization

We propose a novel method for semi-supervised learning (SSL) based on data-driven distributionally robust optimization (DRO) using optimal transport metrics. Our proposed method enhances generalization error by using the unlabeled data to…

Machine Learning · Statistics 2020-04-21 Jose Blanchet , Yang Kang