Related papers: Conditional Mutual Information Constrained Deep Le…

Normalized Conditional Mutual Information Surrogate Loss for Deep Neural Classifiers

In this paper, we propose a novel information theoretic surrogate loss; normalized conditional mutual information (NCMI); as a drop in alternative to the de facto cross-entropy (CE) for training deep neural network (DNN) based classifiers.…

Machine Learning · Computer Science 2026-01-21 Linfeng Ye , Zhixiang Chi , Konstantinos N. Plataniotis , En-hui Yang

Going Beyond Feature Similarity: Effective Dataset Distillation based on Class-Aware Conditional Mutual Information

Dataset distillation (DD) aims to minimize the time and memory consumption needed for training deep neural networks on large datasets, by creating a smaller synthetic dataset that has similar performance to that of the full real dataset.…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xinhao Zhong , Bin Chen , Hao Fang , Xulin Gu , Shu-Tao Xia , En-Hui Yang

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone…

Machine Learning · Computer Science 2022-10-04 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Pruning Deep Convolutional Neural Network Using Conditional Mutual Information

Convolutional Neural Networks (CNNs) achieve high performance in image classification tasks but are challenging to deploy on resource-limited hardware due to their large model sizes. To address this issue, we leverage Mutual Information, a…

Machine Learning · Computer Science 2024-11-28 Tien Vu-Van , Dat Du Thanh , Nguyen Ho , Mai Vu

Neural Estimators for Conditional Mutual Information Using Nearest Neighbors Sampling

The estimation of mutual information (MI) or conditional mutual information (CMI) from a set of samples is a long-standing problem. A recent line of work in this area has leveraged the approximation power of artificial neural networks and…

Information Theory · Computer Science 2021-10-27 Sina Molavipour , Germán Bassi , Mikael Skoglund

CCMI : Classifier based Conditional Mutual Information Estimation

Conditional Mutual Information (CMI) is a measure of conditional dependence between random variables X and Y, given another random variable Z. It can be used to quantify conditional dependence among variables in many data-driven inference…

Machine Learning · Computer Science 2019-06-10 Sudipto Mukherjee , Himanshu Asnani , Sreeram Kannan

C-MI-GAN : Estimation of Conditional Mutual Information using MinMax formulation

Estimation of information theoretic quantities such as mutual information and its conditional variant has drawn interest in recent times owing to their multifaceted applications. Newly proposed neural estimators for these quantities have…

Machine Learning · Computer Science 2020-07-24 Arnab Kumar Mondal , Arnab Bhattacharya , Sudipto Mukherjee , Prathosh AP , Sreeram Kannan , Himanshu Asnani

Reasoning About Generalization via Conditional Mutual Information

We provide an information-theoretic framework for studying the generalization properties of machine learning algorithms. Our framework ties together existing approaches, including uniform convergence bounds and recent methods for adaptive…

Machine Learning · Computer Science 2020-06-22 Thomas Steinke , Lydia Zakynthinou

DeepMI: A Mutual Information Based Framework For Unsupervised Deep Learning of Tasks

In this work, we propose an information theory based framework DeepMI to train deep neural networks (DNN) using Mutual Information (MI). The DeepMI framework is especially targeted but not limited to the learning of real world tasks in an…

Computer Vision and Pattern Recognition · Computer Science 2022-03-07 Ashish Kumar , Laxmidhar Behera

Towards Undistillable Models by Minimizing Conditional Mutual Information

A deep neural network (DNN) is said to be undistillable if, when used as a black-box input-output teacher, it cannot be distilled through knowledge distillation (KD). In this case, the distilled student (referred to as the knockoff student)…

Machine Learning · Computer Science 2025-07-02 Linfeng Ye , Shayan Mohajer Hamidi , En-hui Yang

Conditional Mutual Information-Based Generalization Bound for Meta Learning

Meta-learning optimizes an inductive bias---typically in the form of the hyperparameters of a base-learning algorithm---by observing data from a finite number of related tasks. This paper presents an information-theoretic bound on the…

Machine Learning · Computer Science 2021-02-09 Arezou Rezazadeh , Sharu Theresa Jose , Giuseppe Durisi , Osvaldo Simeone

Leveraging Conditional Mutual Information to Improve Large Language Model Fine-Tuning For Classification

Although large language models (LLMs) have demonstrated remarkable capabilities in recent years, the potential of information theory (IT) to enhance LLM development remains underexplored. This paper introduces the information theoretic…

Computation and Language · Computer Science 2025-05-01 Thanushon Sivakaran , En-Hui Yang

Diffeomorphic Information Neural Estimation

Mutual Information (MI) and Conditional Mutual Information (CMI) are multi-purpose tools from information theory that are able to naturally measure the statistical dependencies between random variables, thus they are usually of central…

Machine Learning · Computer Science 2022-11-22 Bao Duong , Thin Nguyen

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets. As a main-stream loss function, the cross entropy can…

Machine Learning · Computer Science 2022-09-22 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Learning Kernel for Conditional Moment-Matching Discrepancy-based Image Classification

Conditional Maximum Mean Discrepancy (CMMD) can capture the discrepancy between conditional distributions by drawing support from nonlinear kernel functions, thus it has been successfully used for pattern classification. However, CMMD does…

Computer Vision and Pattern Recognition · Computer Science 2020-08-25 Chuan-Xian Ren , Pengfei Ge , Dao-Qing Dai , Hong Yan

A Probabilistic Representation of DNNs: Bridging Mutual Information and Generalization

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI…

Machine Learning · Computer Science 2021-06-21 Xinjie Lan , Kenneth Barner

Conditional Deep Learning for Energy-Efficient and Enhanced Pattern Recognition

Deep learning neural networks have emerged as one of the most powerful classification tools for vision related applications. However, the computational and energy requirements associated with such deep nets can be quite high, and hence…

Computer Vision and Pattern Recognition · Computer Science 2016-01-29 Priyadarshini Panda , Abhronil Sengupta , Kaushik Roy

Contrastive MIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning

Learning representations that generalize well to unknown downstream tasks is a central challenge in representation learning. Existing approaches such as contrastive learning, self-supervised masking, and denoising auto-encoders address this…

Machine Learning · Computer Science 2025-09-10 Micha Livne

On The Utility of Conditional Generation Based Mutual Information for Characterizing Adversarial Subspaces

Recent studies have found that deep learning systems are vulnerable to adversarial examples; e.g., visually unrecognizable adversarial images can easily be crafted to result in misclassification. The robustness of neural networks has been…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Chia-Yi Hsu , Pei-Hsuan Lu , Pin-Yu Chen , Chia-Mu Yu

Contrastive Mutual Information Learning: Toward Robust Representations without Positive-Pair Augmentations

Learning representations that transfer well to diverse downstream tasks remains a central challenge in representation learning. Existing paradigms -- contrastive learning, self-supervised masking, and denoising auto-encoders -- balance this…

Machine Learning · Computer Science 2025-09-29 Micha Livne