English
Related papers

Related papers: Conditional Mutual Information Constrained Deep Le…

200 papers

In this paper, we propose a novel information theoretic surrogate loss; normalized conditional mutual information (NCMI); as a drop in alternative to the de facto cross-entropy (CE) for training deep neural network (DNN) based classifiers.…

Machine Learning · Computer Science 2026-01-21 Linfeng Ye , Zhixiang Chi , Konstantinos N. Plataniotis , En-hui Yang

Dataset distillation (DD) aims to minimize the time and memory consumption needed for training deep neural networks on large datasets, by creating a smaller synthetic dataset that has similar performance to that of the full real dataset.…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xinhao Zhong , Bin Chen , Hao Fang , Xulin Gu , Shu-Tao Xia , En-Hui Yang

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone…

Machine Learning · Computer Science 2022-10-04 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Convolutional Neural Networks (CNNs) achieve high performance in image classification tasks but are challenging to deploy on resource-limited hardware due to their large model sizes. To address this issue, we leverage Mutual Information, a…

Machine Learning · Computer Science 2024-11-28 Tien Vu-Van , Dat Du Thanh , Nguyen Ho , Mai Vu

The estimation of mutual information (MI) or conditional mutual information (CMI) from a set of samples is a long-standing problem. A recent line of work in this area has leveraged the approximation power of artificial neural networks and…

Information Theory · Computer Science 2021-10-27 Sina Molavipour , Germán Bassi , Mikael Skoglund

Conditional Mutual Information (CMI) is a measure of conditional dependence between random variables X and Y, given another random variable Z. It can be used to quantify conditional dependence among variables in many data-driven inference…

Machine Learning · Computer Science 2019-06-10 Sudipto Mukherjee , Himanshu Asnani , Sreeram Kannan

Estimation of information theoretic quantities such as mutual information and its conditional variant has drawn interest in recent times owing to their multifaceted applications. Newly proposed neural estimators for these quantities have…

Machine Learning · Computer Science 2020-07-24 Arnab Kumar Mondal , Arnab Bhattacharya , Sudipto Mukherjee , Prathosh AP , Sreeram Kannan , Himanshu Asnani

We provide an information-theoretic framework for studying the generalization properties of machine learning algorithms. Our framework ties together existing approaches, including uniform convergence bounds and recent methods for adaptive…

Machine Learning · Computer Science 2020-06-22 Thomas Steinke , Lydia Zakynthinou

In this work, we propose an information theory based framework DeepMI to train deep neural networks (DNN) using Mutual Information (MI). The DeepMI framework is especially targeted but not limited to the learning of real world tasks in an…

Computer Vision and Pattern Recognition · Computer Science 2022-03-07 Ashish Kumar , Laxmidhar Behera

A deep neural network (DNN) is said to be undistillable if, when used as a black-box input-output teacher, it cannot be distilled through knowledge distillation (KD). In this case, the distilled student (referred to as the knockoff student)…

Machine Learning · Computer Science 2025-07-02 Linfeng Ye , Shayan Mohajer Hamidi , En-hui Yang

Meta-learning optimizes an inductive bias---typically in the form of the hyperparameters of a base-learning algorithm---by observing data from a finite number of related tasks. This paper presents an information-theoretic bound on the…

Machine Learning · Computer Science 2021-02-09 Arezou Rezazadeh , Sharu Theresa Jose , Giuseppe Durisi , Osvaldo Simeone

Although large language models (LLMs) have demonstrated remarkable capabilities in recent years, the potential of information theory (IT) to enhance LLM development remains underexplored. This paper introduces the information theoretic…

Computation and Language · Computer Science 2025-05-01 Thanushon Sivakaran , En-Hui Yang

Mutual Information (MI) and Conditional Mutual Information (CMI) are multi-purpose tools from information theory that are able to naturally measure the statistical dependencies between random variables, thus they are usually of central…

Machine Learning · Computer Science 2022-11-22 Bao Duong , Thin Nguyen

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets. As a main-stream loss function, the cross entropy can…

Machine Learning · Computer Science 2022-09-22 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Conditional Maximum Mean Discrepancy (CMMD) can capture the discrepancy between conditional distributions by drawing support from nonlinear kernel functions, thus it has been successfully used for pattern classification. However, CMMD does…

Computer Vision and Pattern Recognition · Computer Science 2020-08-25 Chuan-Xian Ren , Pengfei Ge , Dao-Qing Dai , Hong Yan

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI…

Machine Learning · Computer Science 2021-06-21 Xinjie Lan , Kenneth Barner

Deep learning neural networks have emerged as one of the most powerful classification tools for vision related applications. However, the computational and energy requirements associated with such deep nets can be quite high, and hence…

Computer Vision and Pattern Recognition · Computer Science 2016-01-29 Priyadarshini Panda , Abhronil Sengupta , Kaushik Roy

Learning representations that generalize well to unknown downstream tasks is a central challenge in representation learning. Existing approaches such as contrastive learning, self-supervised masking, and denoising auto-encoders address this…

Machine Learning · Computer Science 2025-09-10 Micha Livne

Recent studies have found that deep learning systems are vulnerable to adversarial examples; e.g., visually unrecognizable adversarial images can easily be crafted to result in misclassification. The robustness of neural networks has been…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Chia-Yi Hsu , Pei-Hsuan Lu , Pin-Yu Chen , Chia-Mu Yu

Learning representations that transfer well to diverse downstream tasks remains a central challenge in representation learning. Existing paradigms -- contrastive learning, self-supervised masking, and denoising auto-encoders -- balance this…

Machine Learning · Computer Science 2025-09-29 Micha Livne
‹ Prev 1 2 3 10 Next ›